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Preface 


This book was written to be a textbook for seniors or first-year graduate 
students interested in electrical communication systems. The book departs 
from the usual format in two major respects. First, the text does not cover 
analog or pulse modulation systems. Only digital communication systems 
are discussed. The clear trend in today’s academia is to teach less analog 
and more digital material. I believe we shall soon see the day when the 
principal communication courses are all digital. Analog subjects, if taught 
at all, will be in a course separate from that covering the digital systems. 
I hope the book will provide good service to the trend to all-digital courses. 

The second way the book departs from the usual format is the omission 
from the main text of the topics of deterministic signals (Fourier series, 
Fourier transforms, etc.), networks (transfer functions, bandwidth, etc.), 
and random signals and noise. The reason is that I believe these topics 
should no longer be a part of a communication course. There are enough 
modern communication subjects to be covered without having to include 
these subjects too. However, for students whose backgrounds are a bit 
rusty or for instructors who still include some of these subjects, I have 
provided very succinct reviews in Appendixes A and B. The only other 
student background required to use the book is that typical of senior electrical 
engineering students. 

Content of the book has been selected to be adequate for a one- 
semester course in communication theory. By selective omission (Af-ary 
subjects in Chapter 6, for example) the content can match a course of one 
quarter length. Both courses presume three hours per week of classroom 
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Preface 


exposure. A good preview of the book’s content can be obtained from 
either the table of contents or from reading the short SUMMARY AND 
DISCUSSION sections at the ends of Chapters 2 through 6. 

Because the text is intended to be a textbook, a large number of 
problems is included (over 370 including appendixes). The more-advanced 
or lengthy problems and text sections are keyed by a star (★). A complete 
solutions manual is available to instructors from the publisher. 

Several people have helped to make this book possible. Dr. Leon W. 
Couch, II, of the University of Florida and Dr. D. G. Daut of Rutgers 
University read the full text and made many valuable improvements. Mr. 
Mike Meesit independently worked many of the problems and suggested 
improvements. D. I. Starry cheerfully typed the entire manuscript in its 
several variations, and the University of Florida made her services available. 
To these I extend warmest thanks. I am also grateful to Addison-Wesley 
Publishing Company for permission to reproduce freely material from Chapter 
7 of my earlier book, Communication System Principles (1976). Finally, 
my wife, Barbara, provided invaluable assistance in proofreading, and her 
efforts are deeply appreciated. 

Gainesville, FL Peyton Z. Peebles, Jr. 


Chapter 1 


Introduction 


This book is concerned with the transfer of information from one point to 
another by use of digital communication ( digicom )t systems. Although we 
subsequently define digicom systems more precisely, we broadly define 
them as systems that convey information by using only a finite set of discrete 
symbols. Printed English text is an example of such symbols. Here a finite 
set of symbols (26 letters, space, numbers 0 to 9, some special characters, 
and various punctuation symbols) is used to convey information to a reader. 


1.1 SOME HISTORY 

The use of digital methods to convey information is not new. Some of the 
concepts we consider to be very modern and state of the art today were 
actually in existence over 380 years ago [1]. One of the earliest digicom 
techniques was due to Francis Bacon (1561-1626), an English philosopher. 
In 1605 Bacon developed a two-letter alphabet that could be used to represent 
24 letters* of the usual alphabet by five-letter “words” using the two basic 
letters. Figure 1.1-1 illustrates the representations for the first few alphabet 
letters. In today’s terminology the five-letter “words” would be called 
codewords, and the collection of all these codewords would be called a 

t It’s hoped that most will forgive the author's minor abuse of the language in coining 
this new word. 

i Dr. M. New, English Department, University of Florida, has informed the author 
that letters j and v were missing in the early English alphabet. 
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Sec. 1.1 Some History 


A = a a a a a 
B = a a a a b 
C = a a a b a 
D = a a a b b 
E = a a b a a 
F = a a b a b 
G = a a b b a 


H = a a b b b 
/ = a b a a a 
K = a b a a b 
L = a b a b a 
M = a b a b b 
N = a b b a a 
O = a b b a b 


(more than enough for the 24 for which it was intended). The codes using 
three and five basic letters may represent as many as 3 3 = 27 and 5 2 = 
25 alphabet letters, respectively. 

In an apparently independent development in 1703, Gottfried Wilhelm 
Leibniz, a German mathematician, described a binary code using only the 
two numbers 0 and 1 to represent integers, as illustrated in Fig. 1.1-3, 
Leibniz’s binary code seems to be the earliest forerunner of today’s natural 
binary code (Chap. 3), where we use binary digits 0 and 1. By replacing 
the ordered letters A, B, . . ., Z by integers and letters a and b by 0 and 
l,t respectively, we see that the Bacon and Leibniz codes are equivalent. 


Figure 1.1-1. Francis Bacon's use of two basic letters to represent alphabet 
letters by five-letter codewords (circa 1605). (Adapted from [1], © 1983 IEEE, 
with permission.) 

code. Since only two basic letters are used, it would be called a binary 
code. 

In 1641, shortly after Bacon’s death, John Wilkins (1614-1672), a 
theologian and mathematician, showed how codewords could be made shorter 
by using more basic letters [1]. Figure 1.1-2 illustrates Wilkins’ two-, three- 
and five-letter codewords. In present-day terminology these three codes 
would be called M - ary codes (M = 2, 3, and 5) of respective lengths 5, 3, 
and 2. Because the binary code has two basic letters per position and five 
positions per codeword, it can represent as many as 2 5 = 32 alphabet letters 


Figure 1.1-3. Leibniz’s binary code- 
words representing integer numbers 
(circa 1703). (Adapted from [1], © 
1983 IEEE, with permission.) 


Integer 

Codeword 

0 

0 

2 

1 0 

3 

1 I 

4 

1 0 0 

5 

1 0 1 

6 

I 1 0 

7 

I 1 1 

8 

10 0 0 


Alphabet Letter 

Codewords Using Basic Letters Indicated 

a, b 

a, b, c 

a , b, c, d , e 

A 

a a a a a 

a a a 

a a 

B 

a a a a b 

a a b 

a b 

L 

a b a b a 

ebb 

c a 

M 

a b a b b 

c b c 

c b 

Y 

b a b b a 

b c b 

e c 

Z 

b a b b b 

b c c 

e d 


The principal practical uses of the digital codes of Bacon, Wilkins, 
and Leibniz in communication systems involved various forms of optical 
telegraph links [1]. The first link was in France in 1794 [1]. After the birth 
of electrical technology around 1800, when Volta discovered the primary 
battery, digital communication by electrical methods began to evolve. The 
most important system was the telegraph perfected by Samuel Morse in 
1837. The Morse code was in reality a binary code where the two “letters,” 
called a dot and a dash, were transmitted as short or long electrical pulses, 
respectively. Later on, in 1875, Emile Baudot developed another code used 
today in international telegraph [2]; it uses binary digits 0 and 1 instead of 
dots and dashes and has a fixed number (five) of digits per codeword instead 
of a variable-length codeword, as in the Morse code. 

After the theoretical prediction of electromagnetic radiation in 1864 
by James Clerk Maxwell (1831-1879), a Scottish physicist, and its experimental 
verification in 1887 by Heinrich Hertz (1857-1894), a German physicist, 


Figure 1.1-2- Codewords for Wilkins two-, three-, and five-letter alphabets -f- The left-most unnecessary zeros are dropped in the Bacon code or, equivalently, 

(circa 1641). (Adapted from [1], © 1983 IEEE, with permission.) added to the Leibniz code. 
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the transmission of information by radio telegraph was later achieved by 
Marconi in 1897. 

Not all digicom systems are designed to convey the alphabet as messages. 
In 1937 Alec Reeves conceived one of the most important digital techniques 
in use today [3]. It is called pulse code modulation (PCM) and it can 
convey messages such as the audio waveform produced by a microphone. 
In PCM the message is first periodically sampled. Each sample, which can 
be any value in a continuum of possible values, is rounded or quantized 
to one of a finite set of discrete amplitudes. The amplitudes are then no 
different, conceptually, from the finite-sized alphabet or a finite set of 
integers as encoded by Bacon or Leibniz earlier. Thus the amplitudes are 
assigned codewords similar to those of Fig. 1.1-3. Finally, the PCM procedure 
assigns suitable electrical waveforms to represent the codewords. For ex- 
ample, a rectangular video pulse might be used to represent a binary 1, 
whereas no pulse might correspond to a binary 0. 

The preceding short historical sketch is by no means complete (see 
[1] for more details and other references). It does, however, serve to 
indicate that many of the basic concepts used in modem digicom systems 
are not new and have their roots in the minds of men who lived over 380 
years ago. It also has served to highlight some of the concepts that remain 
crucial to modem systems, such as sampling, quantization, and coding. 

1.2 SOME DEFINITIONS 

Prior to development of details on digicom systems, it is helpful to define 
some quantities that relate to all succeeding work. 

Analog Sources and Signals 

An analog source of information produces an output that can have 
any one of a continuum of possible values at any given’ time. Similarly, 
an analog signal is an electrical waveform that can have any one of a 
continuum of possible amplitudes at any one time. The sound pressure 
from an orchestra playing music is an example of an analog source, whereas 
the voltage from a microphone responding to the sound waves represents 
an analog signal. We shall often refer to an analog signal simply as a 
message. 

Even though the principal thrust of this book is the description of 
digicom systems, we shall also demonstrate in detail how the digital systems 
can be used to convey analog messages. 

In some developments it is helpful to model a message as a deterministic 
waveform (Appendix A), whereas in others we model it as a sample function 
of a random process (Appendix B). Appendixes A and B are included for 
those readers desiring to review waveform representations. 


Sec. 1.2 Some Definitions 

Digital Sourcas and Signals 

We define a digital source as one with an output that can have only 
one of a finite set of discrete values at any given time. Most sources in 
nature are analog. However, when these are combined with some manu- 
factured device, a digital source may result. For example, temperature is 
an analog quantity but when combined with a thermostat with output values 
of on or off, the combination can be considered a digital source. A digital 
signal is defined as an electrical waveform having one of a finite set of 
possible amplitudes at any time. If the thermostat is designed to output a 
voltage when on and no voltage when off, its output is a digital signal, also 
referred to as a message. 

In general, we shall be a bit broad in our use of the word message. 
It will also be frequently used to refer to source outputs. Thus a digital 
source can be said to issue one of a finite set of messages at any one time. 

Signal Classifications 

Analog and digital information signals defined above are assumed to 
be baseband unless otherwise defined. A baseband waveform is one having 
its largest spectral components clustered in a band of frequencies at (or 
near) zero frequency. The term lowpass is often used to mean the same 
as baseband. All practical lowpass messages will have a frequency above 
which their spectral components may be considered negligible. We shall 
call this frequency, denoted by W f for a message labeled /(f), the spectral 
extent of the message (the unit is radians/second). 

A bandpass signal is one with its largest spectral components clustered 
in a band of frequencies removed by a significant amount from zero. Most 
information waveforms are baseband; they are rarely bandpass. The bandpass 
signal is usually the result of one or more messages affecting (modulating) 
a higher frequency signal called a carrier. 

Carrier Modulation 

Most information sources are characterized by baseband information 
signals. However, baseband waveforms cannot be efficiently transmitted 
by radio methods from one point to another. On the other hand, bandpass 
waveforms may readily be transmitted by radio. One basic purpose of 
carrier modulation is, therefore, to shift the message to a higher frequency 
band for better radiation. In some systems carrier modulation can also 
result in improved performance in noise. 

Often modulation involves changing the amplitude, frequency, or phase 
(or combination of these) of a carrier as some function of the message. 
When the message is digital we often refer to these operations as keying — 
for example, phase shift keying (PSK). 
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Sec. 1.3 Digital System Block Diagram and Book Survey 


1 .3 DIGITAL SYSTEM BLOCK DIAGRAM AND BOOK 
SURVEY 

A typical communication system involves a transmitting station (sender), 
a receiving station (user), and a connecting medium called a channel. These 
basic functions are adequate for one-way operation. Two-way communication 
requires each station have both transmitter and receiver. Because operation 
each way is similar, only one-way operation need be described. 

Transmitting Station 

Of course, we are interested mainly in digital systems. A block diagram 
of the principal functions that may be present in a digicom system is illustrated 
in Fig. 1.3-1. Because the overall system is digital, the transmitting subsystem 
can accept such signals directly. It can also work with analog signals if 
they are converted to digital form in the analog-to-digital (A/D) converter. 
A/D conversion involves periodically sampling the analog wave- 
form (to be discussed in Chap. 2) and quantizing the samples. Quantiza- 
tion amounts to rounding samples to the nearest of a number of discrete 
amplitudes. A digital signal results that is compatible with the digital system. 
However, in the rounding process information is lost that limits the accuracy 
with which the analog signal can be reconstructed in the receiver. Recon- 
struction methods are discussed in Chap. 2 while quantization principles 
are developed in Chap. 3. 


Transmitting Subsystem 


Digital 


Analog 

Message 


Digital 

Message 


Analog 

Output 


Digital 

Output 



Receiving Subsystem 


As described earlier, the actual output of the A/D converter at point 
A in Fig. 1.3-1 is a discrete voltage level. For purposes of describing the 
next function, the source encoder, it is helpful simply to imagine the digital 
signal, regardless of its origin, to be one of a finite set of discrete symbols 
(levels) at any given time. The general purpose of the source encoder is 
to convert effectively each discrete symbol into a suitable digital repre- 
sentation, often binary. For example, let a digital message have five possible 
symbols (levels), denoted by m u m 2 , m 3 , m 4 , and m 5 . These levels can 
be represented by a sequence of binary digits 0 and 1, as shown in the 
middle column of Table 1.3-1. Each sequence has three digits; each digit 
in the sequence can be 0 or 1. Clearly, there are 2 3 = 8 possible binary 
codewords,! so that a three-digit binary representation can handle as many 
as eight symbols. For comparison, a ternary representation using a two- 
digit sequence is shown in the third column; here each digit can be a 0, a 
1, or a 2. There are 3 2 = 9 possible ternary codewords, so as many as 
nine symbols could be represented by this ternary code. 


TABLE 1.3-1. Digital Representations for a Five- 
Symbol Message Source. 


Symbols (levels) 
Available 

Symbol 

Representation 

Natural Binary 

Ternary 

m, 

000 

00 

m 2 

001 

01 

m. 

010 

02 

m 4 

011 

10 

m 5 

100 

11 


Digital messages are said to possess redundancy if their symbols are 
not equally probable or are not statistically independent [4]. Most practical 
message sources — the English language, for example — have redundancy; 
otherwise, y cld nt undrstnd ths sentce. A principal purpose of the source 
encoder is to remove redundancy. The more effective the encoder is, the 
more redundancy is removed, which allows a smaller average number of 
binary digits to be used in representing the message. Source encoding is 
developed in Chap. 3. 

In some systems where no channel encoding function is present, the 
source encoder output is converted directly to a suitable waveform (within 
the modulation function) for transmission over the channel. Noise and 
interference added to the waveform cause the receiver’s demodulation op- 
eration to make errors in its effort to recover (determine) the correct digital 


Figure 1.3-1. Functional block diagram of a communication system. 


t The code, which is the collection of all possible codewords, has length 3 and size 8. 
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Sec. 1.4 Simple Baseband System for Reference 


representation used in the transmitter. By including the channel encoding 
function, the effects of channel-caused errors can be reduced. The channel 
encoder makes this reduction possible by adding controlled redundancy to 
the source encoder’s digital representation in a known manner such that 
errors may be reduced. Channel encoding is discussed in Chap. 3. 

Channel 

We shall, throughout the book, assume the channel is linear and time 
invariant. Unless otherwise defined, it is considered to have infinite bandwidth 
with added white Gaussian channel noise of constant power density spectrum 
W 0 /2 applicable on -oo < w < oo. Thus we study only the additive white 
Gaussian noise (AWGN) channel. 

Receiving Station 

The functions performed in the receiving subsystem merely reflect the 
inverse operations of those in the transmitting station. The demodulator 
recovers the best possible version of the output that was produced by the 
channel encoder at the transmitter. The demodulator’s output will contain 
occasional errors caused by channel noise. Part of the optimization of 
various digital systems centers on minimizing errors made in the demodulator. 
Optimum baseband systems are developed in Chaps. 4 and 6, and optimum 
bandpass systems are discussed in Chaps. 5 and 6. 

The purpose of the channel decoder is to reconstruct, to the best 
extent possible, the output that was generated by the source encoder at 
the transmitter. It is here that the controlled redundancy inserted by the 
channel encoder may be used to identify and correct some channel-caused 
errors in the demodulator’s output. Channel decoding is considered in 
Chap. 3. 

The source decoder performs the exact inverse of the source encoding 
function. For digital messages its output becomes the final receiver output 
(point B in Fig. 1.3-1). If the original message was analog, the source 
decoder output is passed through a digital-to-analog (D/A) converter which 
reconstructs the original message using the sampling theory described in 
Chap. 2. 

The reader is cautioned to accept the above discussions with a degree 
of open-mindedness. Remember that the discussions center around the 
functions that may be present in a digicom system and do not always infer 
the actual implementation of a system. Hence practical systems may not 
always follow the blocks in Fig. 1.3-1 exactly. For example, a number of 
digital systems are developed in Chap. 4 that can each accept an analog 
message directly and produce the channel waveform in one operation. 
These systems typically use no channel encoding or decoding operations. 

In the course of this book the various functions of Fig. 1.3-1 are 


discussed in detail. In those digital systems that use analog messages 
through application of A/D and D/A methods, it is helpful to define a 
simple system against which comparisons of noise performance can be 
made. 


1.4 SIMPLE BASEBAND SYSTEM FOR REFERENCE 


The simplest possible analog system is shown in Fig. 1.4-1. A message 
f{t), with spectral extent W f , is transmitted directly to a receiver through 
a channel where white noise of power density JTJ2 is added. The receiver 
is just a lowpass filter (LPF) with bandwidth W f . The filter passes the 
message with negligible distortion but is no wider in bandwidth than necessary 
in order not to pass excessive noise. If the filter is approximated by an 
ideal rectangular passband with unity gain, the output signal and noise 
powers aret 

So = &f)=fHt) (1.4-1) 


x, i f~„ , , J l r% , df 0 Wf 
K * S J d “ • T, J...T ' if' 


- Wf r. 2 it 

where .ffco) is the power density spectrum of the output noise njt). 


(1.4-2) 



Output 


Signal s d (t) 
Noise n d (t) 


Figure 1.4-1. Simple baseband analog communication system useful in comparing 
noise performances of other systems [J], 

At the input to the filter, signal power 5, is 

5,- =A0 = 5 o . (1.4-3) 

We define input noise power TV, as the noise power in the band of the filter. 
Thus 

Ni = N a = (1.4-4) 

2 IT 

In systems using analog messages, an excellent measure of noise 
performance is the output signal-to-noise power ratio. For our elementary 


t The overbar represents either the infinite-time average for a deterministic signal or 
the statistical expected (mean) value for a random message (see Appendix B). 
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baseband system we have 

So) _ /• Sj\ _ 2n 7(f) _ 2tr5,. 

nJ, U/a ' W 7V 0 W/ 

These signal-to-noise ratios provide a good basis against which the per- 
formance of other, more complicated, systems can be compared. 
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Chapter 2 

Sampling 

Principles 


2.0 INTRODUCTION 

When an analog message is conveyed over an analog communication system, 
the full message is typically used at all times. To send the same analog 
signal over a digicom system requires that only its samples be transmitted 
at periodic intervals. Because the receiver can, therefore, receive only 
samples of the message, it must attempt to reconstruct the original message 
at all times from only its samples. Methods exist whereby this desired end 
can be accomplished. These methods involve the theory of sampling which 
is described in this chapter. For the most part we shall need only the 
theory related to lowpass waveforms (Secs. 2. 1-2.5) to continue profitably 
through the book. However, there are several other interesting topics in 
sampling theory that are also included for those readers wishing to delve 
deeper into the subject. 

At first glance it may seem astonishing that only samples of a message 
and not the entire waveform can adequately describe all the information 
in a signal. However, as remarkable as it may seem, we shall find that 
under some reasonable conditions, a message can be recovered exactly 
from its samples, even at times in between the samples. To accomplish 
this purpose we shall introduce some sampling principles (theorems) that 
apply to either deterministic or random signals (noise). These principles 
form the very foundation that makes possible most of the digital systems 
to be studied in the following chapters. A good review of the literature on 
sampling theory has been given by Jerri [1], 
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Sec. 2.1 


Sampling Theorems for Lowpass Nonrandom Signals 


One of the richest rewards to be realized from sampling is that it 
becomes possible to interlace samples from many different information 
signals in time. Thus we have a process of time-division multiplexing anal- 
ogous to frequency multiplexing. With the availability of modem high- 
speed switching circuits, the practicality of time multiplexing is well established 
and, in many cases, may be preferred over frequency multiplexing. 


2.1 SAMPLING THEOREMS FOR LOWPASS NONRANDOM 

SIGNALS 

In this section we shall discuss two sampling theorems. First, we consider 
sampling a nonrandom lowpass waveform. After developing the applicable 
sampling theorem, we show how the original unsampled waveform can be 
recovered from its samples. A simple way of interpreting the sampling 
theorem using networks is introduced to show this fact. The second theorem 
is a sort of dual to the first; we show that the spectrum of a time-limited 
waveform can be determined completely from samples of its spectrum. 

Time Domain Sampling Theorem 

The lowpass sampling theorem may be stated as follows: 

A lowpass signal /(/), bandlimited such that it has no frequency components 
above W f rad/s, is uniquely determined by its values at equally spaced 
points in time separated by T s « ir/W f seconds. t 

This theorem is sometimes called the uniform sampling theorem owing to 
the equally spaced nature of the instantaneously taken samples. t It allows 
us to completely reconstruct a bandlimited signal from instantaneous samples 
taken at a rate co, = 2ir/T s of at least 2 W f , which is twice the highest 
frequency present in the waveform. The minimum rate 2W f is called the 
Nyquist rate. 

The sampling theorem has been known to mathematicians since at 
least 19 15 [4]. Its use by engineers stems mainly from the work of Shannon 
[5]. Proof of the theorem begins by assuming /(/) to be an arbitrary waveform, 
except that its Fourier transform F(o>) exists and is bandlimited such that 
its nonzero values exist only for - W f =£ w « W f . Such a signal is illustrated 
in Fig. 2. l-l(a). From the theory of Fourier series, the spectrum may be 
represented by a Fourier series developed by assuming a periodic spectrum 
Q(a>) as shown in (b). If the “period” of the repetition is s 2 W f , it is 
clear that no overlap of spectral components will occur. With no overlap 

t If a spectral impulse exists at u> = ± W f , the equality is to be excluded [2]. 

4 The theorem can be generalized to nonuniform samples taken one per interval anywhere 
within contiguous intervals T < n/W f in duration [3], 




Figure 2.1-1. Waveforms and spectrums related to sampling, (a) A signal and 
its spectrum, (b) periodic representation for spectrum of (a), and (c) the signal 
reconstructed from its samples [6]. 


me rourier senes giving Q(w) will also equal F(co) for |tu| 
Q(co) = F(w), |o>| « W f . 

By using the complex form of the Fourier series we get 


Q((o) = 2 C/" 2 ™/"', 


vY f . inus 
( 2 . 1 - 1 ) 


( 2 . 1 - 2 ) 


where the series coefficients are given by 

“W 2 

■ ■ 

c„ 




Q{a>)e 


■jn2ir<i}/(os 


do) 


(2.1-3) 


for n - 0, ±1, ±2, ... . Since in the central period Q(w) = F( to), the 
coefficients evaluate to 

-Wf 


. = 2tt_l r 

oj s 2ir J _ v 


2 n J - n2ir 


(2.1-4) 


F{o))e~ in2 ^- do) = —f[ 

-Wf Q)/\ 

We see that the coefficients are proportional to instantaneous samples of 
/(f) at times — n2v/o> s = —nT s , where T, is the time between samples: 

T s = 2n/o) s . (2.1-5) 


U l in I JJHNiRIHHIlHMNi 
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From (2.1-2) the periodic spectrum Qiai) becomes 

G(co) = 2 — 

oi, \ oi, 


(2.1-6) 


which is valid for all ai. Next, we recognize that fit) must be given by the 
inverse Fourier transform of its spectrum, as given by (2.1-1) with (2.1-6) 
substituted. After inverse transformation we have 


£ — 7 (— 

oi, \ 0) s 


■H-f)] 


WAt- 


(2.1-7) 


where we have let it = — n. This equation constitutes a proof of the uniform 
sampling theorem, since it shows that fit) is known for all time from a 
knowledge of its samples. The samples determine the amplitudes of time 
signals in the sum having known form. 

The time signals all have the form sin(x)/x. This form is defined in 
Appendix A as the sampling function Sa(x) — that is, 

/I i o\ 


Sa« = 


( 2 . 1 - 8 ) 


The sampling function is sometimes called an interpolating function. In 
terms of sampling functions, (2.1-7) can be restated as 

w;r °° 

fit) = ^ 2 fikT s )Sa[W f it - kT s )). (2.1-9) 

TT _oo 

Recall that these results were derived under the restriction oi, 

2 W f , requiring a sampling interval 

T,*£. (2.1-10) 


as stated in the theorem. An important special case of (2.1-9) occurs when 
sampling is at the maximum (Nyquist) period T, = tr/Wf 


fit) = 2 fikT s ) SatW/r - kT,)}. 


(2.1-11) 


Figure 2. l-l(c) illustrates a possible signal fit) as being the sum of time- 
shifted sampling functions as given by (2.1-11). 

Two other valuable forms of the sampling theorem as embodied in 
equation form are given without proofs: 
wt ” 

fit - f„) = 2 fikT, - to)Sz[W f it - kT,)] (2.1-12) 


^ 2 MT, - t 0 )Sa[W f it + t 0 - kT,)]. (2.1-13) 

rr fc- -oo 
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The reader may wish to prove (2.1-12) as an exercise. The procedure 
is to retrace the above proof, except replace F(co) by the spectrum 
F(a>)exp( -jait 0 ) corresponding to the delayed signal fit - t 0 ). As a second 
exercise, (2.1-13) follows from (2.1-12) by inspection. (How?) 

Our main results are (2.1-9), (2.1-12), and (2.1-13); these are all valid 
for complex as well as real signals. Waveforms are assumed to be nonrandom, 
however. We develop a sampling theorem for random signals in Sec. 2.2. 

Example 2.1-1 

As an example relating to sampling functions we show that 

r 

U 1 To ^ 0, mfk 

Sa [T*^ ~ kTs A Sa |^ ~ mT ^ dt = 2n 

oi,’ m ~ k ' 

The sampling functions are said to be orthogonal because they satisfy this relationship. 
We shall use Parseval’s theorem, which may be written in the form 

J x(t)y*it) dt = 2 . J Xioi)Y*iai) day 



for two signals x{t) and yit) having Fourier transforms A(m) and Yiiu), respectively. 
Here we use the asterisk to represent the complex conjugate operation. 

Since 



M < oi ,/ 2 
M > 0iJ2, 


where the double-ended arrow represents a Fourier transform pair, we obtain the 
desired result by integration in the frequency domain. 


_f Sa^y (t - kT,) Sa y it - mT,) dt 

= J_ f “‘ /2 feY e-^-^-dai 

2ir J-„, n \oi,/ 


2tt sin[(/n - k)tr] _ jO, m f k 

oi, (m - k)TT yitr/oi,, m = k. 


A Network View of Sampling Theorem 

There is a useful way in which lowpass sampling theorems may be 
viewed by using networks [7], Consider the network of Fig. 2.1-2. Imagine 
that a periodic train of impulses is available. We form the product of fit) 
and this pulse train to get 

Lit) = fit) £ Sit - kT,) = £ fikTMt - kT s ). (2.1-14) 

A«-oo - OO 
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View These | View 

Operations as-« \ ►- the Filter as 

a Transmitter I a Receiver 

I 

I 

I 



U/)= 5 S(t “ kT t ) 

r k ■ 


Figure 2.1-2. Block diagram of a network useful in the interpretation of sampling 
theorems [6], 


The product function using impulses is called an ideal sampler and /(f) is 
the ideally sampled version of /(f). 

Next, assume the filter is ideal with a transfer function 


«<“> - {J: 


|tu| < W f 
|cu| > W f . 


(2.1-15) 


Its impulse response, by inverse Fourier transformation of (2.1-15), is 


h(t) = —f Sa(W>f). 

77 


(2.1-16) 


The filter output response to (2.1-14) now follows easily: 


w 

So(t) = ~ r 2 m 5 ) Sa[W}(f - kT s )] 

w (2.1-17) 

m m 

T s ' 


The last form in (2.1-17) derives from the sampling theorem, which means 
that /(f) must be given by (2.1-9). 

Let us pause to summarize what has been developed. First, a train 
of instantaneous samples of /(/) at the product device output has been 
generated. The sample rate is co s = 2v/T s 3= 2W f . These samples may be 
viewed as the output of a transmitter. Second, by use of an ideal lowpass 
filter having a bandwidth equal to the maximum frequency extent W f of 
/(f), the output is given by the middle term of (2.1-17). The filter may be 
viewed as the receiver which must recover /(f). Finally, by application of 
the sampling theorem, this output equals f{t)/T s . Thus we have shown 
that, within a constant factor, /(f) is completely reconstructed from its 
samples by using an ideal lowpass filter. Reconstruction is valid for any 
sample rate w s & 2 W f . 
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The ideal sampler of Fig. 2.1-2 cannot be realized; however, it can 
be approximated in practice by using a train of very narrow, large amplitude 
pulses. Fortunately, such measures are not usually necessary, since easily 
realizable practical techniques exist for sampling, as noted in Sec. 2.3. 

Aliasing 

Let us examine the spectrum of the ideally sampled signal /(f) of 
(2.1-14). The train of impulses can be replaced by its Fourier series rep- 
resentation to obtain 

fsO) = fit) Z 8(f - kT s ) = y Z /(f)e yw (2.1-18) 
The Fourier transform of /(f), denoted by F s ( a>), is 

F »(®) = / 2 F (w - k(0 s ) (2.1-19) 

1 s k= -oo 

from the frequency shifting property of transforms. F,(o>) is comprised of 
scaled (by 1//) replicas of the message spectrum at all multiples of oj s . If 
F(ai) is bandlimited to W f and if w, s* 2W f , these replicas will not overlap, 
which is required if the filter in Fig. 2.1-2 is to pass an undistorted spectrum 
F(co), the component of F s ( o>) for k = 0. 

If, however, /(f) is not bandlimited or if the sampling rate o> s is not 
high enough, there can be overlap of spectral components, as illustrated 
m Fig. 2.1-3. Spectral overlap in the central replica (k = 0 component) is 
called aliasing. In Fig. 2.1-3(a) aliasing is due to the message not being 
bandlimited. This form of aliasing can be minimized by sampling at a high 
enough rate that replicas become far separated. Another solution might be 
to prefilter the message to force it to be more bandlimited. In (b) aliasing 
is due only to sampling at too low a rate; the solution is to raise o> s . Spectral 
folding, as aliasing is sometimes called, causes higher frequencies to show 
up as lower frequencies in the recovered message; in voice transmission 
intelligibility can be seriously degraded. 

The mean-squared aliasing error, denoted here by E al , can be defined 
as the energy folded into the signal’s band |w| =£ &> s /2 by the shifted replicas 
[8], If <o s is at least large enough that this energy is mainly due to the 
adjacent replicas at ±co s , we havet 

^ j |L’M| 2 dto. (2.1-20) 

The ratio of E al to the undistorted signal’s total energy, E f , will be called 

t If the message is a power signal, the energy density spectrum |F(w)| 2 is replaced by 
the applicable power density spectrum of f(t). 
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- 00 , 01 , 

(b) 


Figure 2.1-3. (a) Sampled signal spectral overlap due to message not being 
bandlimited and (b) spectral overlap caused by undersampling of a bandlimited 
message [6]. 


the fractional aliasing error 



The reciprocal, E f /E aU is called the signal-to-distortion ratio by Gagliardi 
[8]. Other definitions of fractional aliasing error exist. Stremler uses one 
in which higher frequencies are weighted more heavily than lower ones [9] . 
In general, (2.1-21) and other definitions serve only as reasonable measures 
of the intensity of aliasing because the aliasing effects are difficult to measure, 
they depend on the phases of the overlapping spectral components as well 
as their amplitudes. 

Frequency-Domain Sampling Theorem 

A frequency-domain sampling theorem that is analogous to the time- 
domain theorem may also be stated: 
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A Fourier spectrum F(a>), corresponding to a signal f{t), timelimited such 
that it is nonzero only in the interval - 7} s t =s T f , is uniquely determined 
by its values at equally spaced points in frequency separated by an amount 
W s =s tt/ 7} rad/s. 

By analogy, W s and 7} here correspond to T, and Wf, respectively, in the 
time-domain theorem. The result analogous to (2.1-9) may be found to be 
TW °° 

F(a>) = -^ X F(kW s ) Sa[7}(ru - kWJ) (2.1-22) 

v 

for real or complex /(f). This expression can also be proved by repeating 
the procedure that leads to (2.1-9). Similarly, the forms analogous to 
(2.1-12) and (2.1-13) are: 

TW °° 

F(w - o> 0 ) = -t-i X - w,))Sa[7}(co - kWj] (2.1-23) 

77 k= —oo 

TW °° 

F(a>) = X F(kW s - u> 0 )Sa[2}(o> + o> 0 - kW,)]. (2.1-24) 

Aliasing can also occur with frequency-domain sampling. Now, how- 
ever, replicas are in the time domain and are located at multiples of T s = 
2tt/W s . 

2.2 SAMPLING THEOREM 

FOR LOWPASS RANDOM SIGNALS 

The theory of sampling can also be extended to include random signals or 
noise [1, 10-16]. Let a lowpass random signal or noise be represented as 
a sample function of a random process N(t). We assume N(t) to be wide- 
sense stationary with a power density spectrum y N (a>) bandlimited such that 
y N (co) = 0, |ca| > W N , (2.2-1) 

where W N is the maximum spectral extent of N(t). Because ^(co) is band- 
limited and the noise’s autocorrelation function is the inverse transform of 
S^v(oj), the autocorrelation function must have representations of the forms 
of (2.1-9), (2.1-12), and (2.1-13). They are 

W T °° 

*i»(t)-— X R N (kT s )Sa[W N (T - kTf] (2.2-2) 

77 k= -oo 

W T °° 

Rn(t - to) = — — X R N(kT s - hSafWhr - kT s )] (2.2-3) 

tf 

W T °° 

Rv(t) = X RrAkTi - fo)Sa[W N (r + r 0 - kTf], (2.2-4) 

Tf *--~ 

where R n (t) is the autocorrelation function, T, is the period between samples 
of R n (t), and t 0 is an arbitrary constant. 
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We shall now show that N(t) can be represented by the function 

N(t) = ^ £ N(kT,)Sa[W N (t - kT s )] (2.2-5) 

TT -c» 

in the sense that the mean-squared difference (or error) between the actual 
process N(t) and its representation N{t) is zero. In other words, N(t) = 
N(t) in the sense of zero mean-squared error defined by 


e 2 = E{[N(t) - N(t)] 2 } = 0, (2.2-6) 

where £{•} represents the statistical average. By direct expansion (2.2-6) 
becomes 

? = R n (0) - 2E[N(t)N(t)] + E[N\t)], (2.2-7) 

which we show equals zero by finding the various terms. 

By substitution of (2.2-5) into the middle right-side term of (2.2-7) we 
develop the following: 

E[N(t)N(t)] = X E[N(t)N(kT,)]Sa[W N (t - kT,)} 

7T (2.2-8) 

= ^ £ R N (kT, - tmW N (t - kT s )] = R n { 0). 

TT ic*. _oo 

The last form of (2.2-8) is the result of applying (2.2-4) with r = 0 and 
r 0 = t. At this point (2.2-7) becomes 

? = -MO) + E[N\t) ]. (2.2-9) 

By again using (2.2-5) 

£(N 2 «] = 4— £ N(kT,mW N (t - kT,)] 

7T t*-oo 

• ^eL £ N(/T,)Sa[W„(/ - /Tj)]| 

7T J (2.2-10) 


= ^ Sa[W„(/ - kT s )] 

IT _oo 


• ^ £ « w (/r, - itr s )Sa[w N (f - IT,)]. 

TT 

From (2.2-3) with r = t and / 0 = kT„ the second sum in (2.2-10) is 
recognized as R N (t - kT s ) = R^ikT, - t), so 


E[ N 2 (/)]=— £ ~ <)Sa[W N (t - WJ] = M0). (2.2-11) 

TT /k— — oo 

Here we used (2.2-4) again with r = 0 and to = t to obtain the last form 
of (2.2-11). Finally, ~P becomes zero when (2.2-11) is used in (2.2-9). 
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In summary, the above development has shown the following sampling 
theorem to be valid: 

A lowpass wide-sense stationary random process N(t), bandlimited such 
that its power density has no nonzero frequency components above W N 
rad/s or below — W N rad/s, can be represented by its sample values at 
equally spaced times separated by T s s n /W N s; the representation is N(t) 
of (2.2-5), which is valid in the sense that N(t) = N(t) with zero mean- 
squared error. 

Much of our earlier results applicable to nonrandom waveform sampling 
also apply to random signals. For example, the network interpretation of 
the lowpass sampling theorem shown in Fig. 2.1-2 applies. When the random 
process N(t) is ideally sampled in the product device, the sampled version 
A hif) is 

N,(t) = £ N(kT s Mt ~ kT,). (2.2-12) 

Each impulse excites the filter and contributes an output component N(kT s ) 
times the impulse response (Ath impulse). Since the impulse response is 
(2.1-16), we obtain an output random signal N 0 (t) of 

NJLt) = — £ N(kT,)Sa[W N (t - kT,)] = ^ (2.2-13) 

rr 1, 

from (2.2-5), if the filter’s bandwidth is W N . 

Example 2.2-1 

A signal is added to noise; the sum is to be transmitted by ideal sampling. The 
receiver consists of an ideal filter that acts on the ideally sampled sum. We take 
two examples, both of which assume the receiver must recover the signal without 
distortion. 

First, suppose the noise is bandlimited to W N /2n = 2 kHz, whereas the signal 
is bandlimited to W f /2Tr = 3 kHz. For no signal distortion, a sample rate of at 
least 6 kHz is required. Since this rate exceeds 2W n /2it - 4 kHz, the noise will 
also be reconstructed without error if the receiver bandwidth is 3 kHz. 

Second, suppose the bandwidths are reversed. Signal bandwidth is now 2 kHz 
and noise bandwidth is 3 kHz. A sample rate of 4 kHz with a receiver bandwidth 
of 2 kHz leads to perfect message recovery, but the noise is undersampled which 
produces aliasing. Some power-spectrum sketches will show that the total receiver 
output noise power is the same as that transmitted if sampling and receiver matched 
the noise’s Nyquist rate, but the form of the power spectrum of the output noise 
has changed. 
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2.3 PRACTICAL SAMPLING METHODS 

The instantaneous sampling of the preceding sections using impulses has 
been called ideal sampling. It must be approximated in practice by using 
large, vanishingly narrow pulses (in relation to ir/Wj). On the other hand, 
it may be impractical or even undesirable to use very narrow pulses. Practical 
transmissions of interest will then use finite duration pulses. In the following 
paragraphs, we investigate two forms of such practical sampling. In Sec. 

2.4 the techniques are generalized. 

Natural Sampling 

Natural sampling involves a direct product of fit) and a train of 
rectangular pulses, as shown in Fig. 2.3- 1(a). The spectrum of fit) is defined 
as F(tc); it and fit) are sketched in (b). The pulse train s p (t) has amplitude 
K and has the Fourier series expansion 

, , Nt sin(£a),r/2) _ ,^ u . t \ 


, _ Kt 

' t, 


(2.3-1) 


with r being the pulse length and w s = 2tt/T s the pulse rate. The train and 
its spectrum 


a3 - 2> 

are illustrated in (c). The product 

fit) = s p it)fit) (2.3-3) 

is the sampled version of fit) .t 

The spectrum Ffco) of fit) is helpful in visualizing how /(f) is recovered 
from its samples. By recalling that a time product of two waveforms has 
a spectrum given by 1/2 tt times the convolution of the two spectrums, we 
obtain 


siniktDj/l ) , 


(2.3-3) 


F s ico) = — F(oj) * Sfoj) 


kwJFioj - x) dx (2.3-4) 


= ~ £ ^ikmj/i) r S(x _ ko))F(a) _ x) dx (2 3 . 4) 

T s km — oo k(o s r/2 J -o o 

Kt v sin(fco)jT/2) 

= T L -f — TT~ F(w “ k( °J- 
T s kco s r/2 

The product and its spectrum are shown in Fig. 2.3-l(d). 

From (2.3-4) it is seen that, so long as o) s 2 W f , the spectrum of the 
sampled signal contains nonoverlapping, scaled, and frequency-shifted replicas 
of the information signal spectrum. By applying fit) to a lowpass filter of 

t There is an implicit constant involved in the product device having the value 1.0 V"'. 
The purpose of the constant is to preserve units after the product. 
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Figure 2.3-1. (a) Method of natural sampling for waveform of (b) using the pulse 
train of (c) to produce the signal and spectrum of (d). The signal is recovered 
with the lowpass filter of (e) [6], 
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bandwidth W f , as shown in Fig. 2.3-l(e), /(f) is easily recovered without 
distortion.t The spectrum S„(co) of the output signal s 0 (t) from the filter 
will be 

S» = ^F(co), (2.3-5) 

* s 

while the output signal is 

sM) = (2 ' 3 ' 6) 

The foregoing discussion has shown that the product of a message 
/(f) and a realizable train of rectangular, finite-amplitude pulses forms a 
realistic sampling method, that of natural sampling. The spectrum of the 
naturally sampled version /,(f) of /(f) contains undistorted replicas of the 
message spectrum F(co). The central term is just an amplitude-scaled version 
of F(co) that results in reconstruction of /(f) when selected by a lowpass 
filter. From (2.3-6) the filter’s output is seen to be /(f) scaled by the dc 
component, Kt/T s , of the sampling pulse train s„(t). 

The network model of the ideal sampler of Secs. 2.1 and 2.2 hold true 
for natural sampling. This fact follows because the product device [Fig. 
2.3-l(a)] constitutes a practical transmitter of samples of /(f) while the 
receiver consists of an ideal lowpass filter to recover the message. In fact, 
if the sample rate exceeds the Nyquist rate (co, > 2 W f ), there is a “clear” 
region from |eu| = W f to |co| = w s - W f over which a more realistic filter 
can go from full response to negligible response, as noted earlier. 

Flat-Top Sampling 

With this type of sampling the amplitude of each pulse in a pulse train 
is constant during the pulse but is determined by an instantaneous sample 
of /(f) as illustrated in Fig. 2.3-2(a). The time of the instantaneous sample 
is chosen to occur at the pulse center for convenience. t For comparison, 
and to illustrate the differences involved, natural sampling is illustrated 
in (b). 

For the assumed ideal rectangular pulses, the flat-top sampled signal 
is 

m = t £ f{kT s ) K reel ( L= ^) - (2-3-7) 

t If o), > 2 W f , the filter must uniformly pass all frequencies |a>| s W f but may then roll 
off in a practical way out to the point |<u| = gj, — W f , where its response must become 
negligible (ideally zero). As oi s approaches 2 WV , the filter must approach the ideal lowpass 
filter. 

f In a realizable system the sample time must occur at or before the leading edge of 
any given sample pulse, since it is impossible to initiate a pulse of the required amplitude 
before that amplitude is established. 
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Sample Values 
of AT/(f) 


(a) 




(b) 

Figure 2.3-2. (a) Flat-top sampling of a signal /(c) and (b) natural sampling [6]. 


where K is a scale constant; it is the amplitude of a sampling pulse for a 
unit input (dc) signal. The function rect (•) is defined by (A.2-1) of Appen- 
dix A. 

To determine our ability to reconstruct /(f) from the sampled signal 
(2.3-7), it is helpful to observe that the same expression derives from an 
ideal sampler followed by an amplifier with gain Kt and a filter. The filter 
must have an impulse response 


q{t) = - rect( - 

T \T 


and transfer function 


Q( w) = 


sin(cor/2) 


(2.3-8) 


(2.3-9) 


' cut/ 2 ' K ’ 

The network is illustrated in Fig. 2.3-3(a).t It is straightforward to show 
that the spectrum of f s (t) is 


n-/ , Kt v sin(cur/2) N 

F,(cu) = — 2j J2 , — F(w - ka> s ). 

T s k= Off/2 


(2.3-10) 


F s (<i>) of (2.3-10) appears on the surface to be similar to (2.3-4) for 


t This model is only for mathematical and conceptual convenience. Actual implementation 
in practice may be quite different. The amplifier is implied simply so that Q(w) as defined 
may have a low-frequency gain of unity. The reasoning will become clearer as the reader 
proceeds to subsequent developments. 
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-Wf aw, « 

(c) 


figure 2.3-3. (a) Generation method for flat-top samples, (b) Signal recovery 
method requiring the equalization filter with the transfer function of (c) [6]. 

natural sampling. There is an important difference, however. A lowpass 
filter operating on (2.3-10) will not give a distortion-free output proportional 
to /(r). To see this, suppose a lowpass filter alone were used. The output 
spectrum would be (KT/T s )F{co)sm(<oT /2)/(un/2), which is clearly not pro- 
portional to F(w) as needed. The factor Qi<o) = sin(wr/2)/(<W2) represents 
distortion which may be corrected by adding a second filter, called an 
equalizing filter. It must have a transfer function H eq i<o) = 1 /G(*>) for 
|to| =£ W f , that is, 

jw| ^ W f 

//<»= Sin(wT/2) ( 2 - 3 - H) 
[arbitrary elsewhere. 

The equalized output spectrum is 

sM = j-m. 


(2.3-12) 
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and the output signal is 

sJLO = f/W- (2.3-13) 

5 

Our analysis has shown, in summary, that flat-top sampling still allows 
distortion-free reconstruction of the information signal from its samples as 
long as a proper equalization filter is added to the reconstruction (lowpass) 
filter path. These operations are given in block diagram form in Fig. 
2.3-3(b). The equalizing filter transfer function is shown in (c). As in natural 
sampling we again find that the output s„(t) is proportional to fit) with a 
proportionality constant equal to the dc component of the sampling pulse 
train. In the next section this fact will again be evident even when arbitrarily 
shaped sampling pulses are used. 

2.4 PRACTICAL SAMPLING 

WITH ARBITRARY PULSE SHAPES 

In the real world pulses are never perfectly rectangular as was assumed 
above for natural and flat-top sampling. To have such pulses would require 
infinite bandwidth in all circuits, an obviously unrealistic situation. It becomes 
appropriate to then ask: What can be done in a more realistic system? To 
answer this question, let us assume an arbitrarily shaped sampling pulse 
pit). The sampling pulse train will be a sum of such pulses occurring at 
the sampling rate (o s . The spectrum of pit) is defined as Pirn). 

Consider first a generalization of natural sampling. The sampling pulse 
train now becomes 

= Z ?('- W.), (2.4-1) 

where pit) is some arbitrary pulse shape as illustrated in Fig. 2.4-l(a). 
Expanding s p (t) into its complex Fourier series having coefficients C k , we 
develop the sampled signal as 

Lit) = fit) 2 Pit ~ kT s ) = £ CJifie**. (2.4-2) 

k.= —OO k = -03 

The spectrum of Lit) becomes 

Ff<») = 2 C k F(a) - kw s ). (2.4-3) 

— oo 

Both Lit) and Ffw) are sketched in Fig. 2.4-l(b). 

Again we see that by using a lowpass filter, the signal fit) may be 
recovered without distortion. The filter will pass the term of (2.4-3) for 
k = 0. The output is easily found to be 

Sait) = C 0 fit). 


(2.4-4) 







- Q - " ’ , -iu t -2u, -u, 0 3 “. “ 

(b) 

Figure 2.4-1. Spectrums associated with natural sampling using arbitrarily shaped 
pulses, (a) Train of arbitrary pulses and its spectrum, (b) Sampled signal and 
its spectrum [6]. 


Here C 0 is the dc component of the sampling pulse train. We may define 
parameters K and r such that C„ = Kt/T s , where K is defined as the actual 
amplitude of p(f) at t = 0. We may think of r as the duration of an 
equivalent rectangular pulse that gives the same pulse tram dc level and 
has the same amplitude at t = 0 as the actual pulse. It is easily determined 
that r is given by 


T 


j_ r 

pi 0) J -r ,/2 


pit) dt. 


(2.4-5) 


With these definitions (2.4-4) becomes 


S„lt) = 

* s 


(2.4-6) 


The analysis above has shown that the only effect of using an arbitrary 
pulse shape in the sampling pulse train, as far as signal recovery in natural 
sampling goes, is to produce a reconstructed signal scaled by a factor equal 

to the dc component of the pulse train. 

Flat-top (instantaneous) sampling may also be generalized. Following 
the points discussed above we define an arbitrary pulse shape qit), having 
a spectrum Q( o>), which is related to the arbitrary pulse p(t) by 


pit ) = Krq(t). 


(2.4-7) 
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It is a normalized version of p(t) having amplitude 1 /t at t = 0 and unit 
spectral magnitude! at w = 0. As before, K is the amplitude of pit) at 
t = 0 and r is the equivalent rectangular pulse of pit) found from (2.4-5). 
The sampled signal becomes 


fM = 2 f{kT s )Kjq{t - kT s ). 


This is a generalization of (2.3-7). We now recognize that the block diagram 
of Fig. 2.3-3(a) applies to (2.4-8) if the filter impulse response is qit). The 
spectrum at the filter output is now found to be 

F» = ~ 2 Q(a>)F(a> - ka> s ). (2.4-9) 

* s k- -°o 

Again, in reconstruction of fit), only the term for k = 0 is of interest, 
since it is the only one passed by the lowpass filter. The output spectrum 
is KtQI<j))F(<j)) / T s , and the factor £>(&>) represents distortion. As in the flat- 
top case, we may use an equalizing filter to remove the distortion. The 
required filter transfer function is 

H eq {o>) = -J-. (2.4-10) 


The overall output with equalization becomes 

S 0 (a>) = ~ Fib)), 

* S 

Soil) = yf(». 


(2.4-11) 


In all our practical sampling methods the output of the reconstruction 
filter in the receiver is fit) scaled by the dc component of the unmodulated 
pulse train. 


Example 2.4-1 

A flat-top sampling system uses a train of triangular pulses defined by (A. 2-3) of 
Appendix A: 

Pit) = 2tri(^j «* 2t 0 Sa ! ^j = P(o>) 

where r 0 = 1 /xs. The message being sampled is assumed bandlimited to W f /2n = 
5 kHz and sampling is at the Nyquist rate. We define K, r, and qit) for this system 
and find the receiver’s output waveform. 

From the definition of K, 

K = p(0) = 2 V. 


t This is seen by calculating the Fourier series coefficient C 0 for the periodic waveforms 
using pulses of (2.4-7), substituting the Fourier transform of q(t) for a> = 0 and using (2.4-5). 
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For Nyquist sampling T, = ir/W f - 1/[2(5)10 3 ] - 10 4 s. From (2.4-5) 


- 5 C" 2 ‘iw’) “■ - L- 4 ' ■ ,o “ ■- 




Finally, from (2.4-12) the receiver’s output is 

s o 0) = = 0.02/0) ■ 

* s 


Observe that the recovered message has a relatively small amplitude (0.02 scale 
factor). In the next section, ways of increasing the output are given. 


2.5 PRACTICAL SIGNAL RECOVERY METHODS 

Practical ways of sampling messages were developed in the last two sections. 
The transmitted train of samples usually takes the form of either (2.4-8) 
for flat-top sampling or (2.4-2) for natural sampling. In both cases the 
sampling pulses could have arbitrary shape. The only message recovery 
method discussed was the lowpass filter with appropriate equalization, as 
needed. In most cases these filters produce a low level response because 
the factor t/T s common to all systems is often much less than unity. 

In this section we introduce two practical message recovery methods 
that increase the receiver’s output level compared to using a lowpass filter. 

Signal Recovery by Zero-Order Sample-Hold 

Because of the factor t/T , the output of the receiver using the lowpass 
filter reconstruction method is not as large as we might like. A simple 
sample-hold circuit may be used to increase the output by a factor TJt. 
The circuit is shown in Fig. 2.5-l(a). The amplifier gain is arbitrary and 
assumed to equal unity; it only needs to provide a very low output impedance 
when driving the capacitor. 

For purposes of description, assume the input to the sample-hold 
circuit is a flat-top signal as illustrated in (b). At the sample points (shown 
as heavy dots), the switch instantaneouslyt closes and the output level 
equals the input sample amplitude. While the switch is open the capacitor 
holds the voltage, as shown dashed, until the next closure. The output still 
looks like a flat-top sampled signal, but its pulse width is now T s instead 
of t seconds. 

t In practice it would close just after the start of a sample pulse and open just before 
it ends. 
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Impulsive 

Sampling 


(b) 

Figure 2.5-2. (a) Receiver using a sample-hold circuit for signal recovery, 
b) Equivalent receiver [6], 

The equalizer response is the final output 

s 0 (t) = Kf(t). (2.5-6) 

The above operations are illustrated in Fig. 2.5-2(a). A dc block is 
shown because some messages contain a dc component that is often not 
required (or even desired) in the output. In the present discussion the dc 
block can be ignored. From the standpoint of message reconstruction the 
overall sample-hold receiving system can be replaced by the equivalent 
system illustrated in (b). It is made up of an ideal sampler followed by an 
ideal filter having a gain T s and bandwidth W f . 

The sample-hold circuit described above is called zero-order because 
the held voltage may be described by a polynomial of order zero. 

First-Order Sample-Hold 

Figure 2.5-3 illustrates the process of first-order sampling and holding. 
At a particular instant (say kT s ), a sample of the signal is held until the 
next sample. Now, however, rather than being a constant level held, 
the level between samples varies linearly. The slope is determined by the 
present sample (at time kT s ) and the immediately past sample [at (k - 
1 ) 7 ;]. 

The output of the sample-hold circuit again has the spectrum of 
(2.5-3) where the transfer function H(a>) of the sample-hold circuit is now 

[17] 

H(u) = T,(\ + jo>T,) exp(-jo)T J ). (2.5-7) 
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Figure 2.5-3. Waveforms applicable to first-order sample-hold [6]. 


The block diagram of Fig. 2.5-2 also applies to first-order sampling and 
holding. The equalization filter transfer function to be used is 


H ',{<*) 


TJU(oj), 

arbitrary, 


|o)| W f 
elsewhere. 


(2.5-8) 


The first-order sample-hold operation derives its name from the fact that 
its held voltage is described by a polynomial of order one. 


Highsr-Order Sample-Holds 


Sample-hold operations in general may be fractional or higher integer- 
order. These are discussed in [17]. A zero-order operation is capable of 
reproducing a constant (zero-order polynomial) signal fit) perfectly. A first- 
order sample-hold operation can reproduce exactly a constant or ramp (first- 
order polynomial) signal fit). Thus, an nth-order sample-hold can reproduce 
exactly a polynomial signal of order n. Sample-hold circuits above first- 
order are rarely used in practice for various reasons including economy. 
Fractional-order data holds are sometimes preferred in control systems [17]. 


*2.6 SAMPLING THEOREMS FOR BANDPASS SIGNALS 

All our preceding discussions of sampling have dealt only with lowpass 
waveforms. In this section we consider sampling of bandpass waveforms. 
Generally, the theory involved in sampling theorems for bandpass signals 
is complicated. The complication arises from the fact that two spectral 
bands are involved in the bandpass case (one at + a> 0 and one at -&> 0 , co 0 
being the carrier frequency), as opposed to only one in the lowpass case 
(from — W f to + Wf. Sampling again produces shifted replicas of the message 
spectrum; these are now more difficult to control in order to avoid aliasing. 
Since the choice of sampling rate, (d s , is the only control available over 
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the positions of the replicas, there is less freedom in choosing values of to, 
in bandpass sampling. 

Direct Sampling of Bandpass Signals 

Recall that the lowpass sampling theorem allows a signal /(f) to have 
the expansion 

fit) = £ f{kTM‘ - m, (2 ' 6 ’ 1) 

where T, is the period between samples taken periodically at times kT„ 
f{kT ) are the samples, and hit) is a suitable function. The function hit) 
was defined by either (2.1-15) or (2.1-16) for lowpass sampling. We found 
it helpful to note that (2.6-1) could be interpreted as the response of the 
network of Fig. 2.1-2.t In direct sampling of a bandpass waveforin it is 
again possible to find a function hit) such that (2.6-1) applies [18]. Thus, 
in essence, (2.6-1) is the sampling theorem for bandpass signals and its 
form guarantees that fit) can be reconstructed from the ideally sampled 
version of /(f) using the network of Fig. 2.1-2.t 

Let fit) be a bandpass signed having spectral components only in e 
ranee W„ =£ U «= VV 0 + w r- 11 results that the Nyquist (minimum) sam- 
pling rate 2 W f can now be realized only for certain discrete values of 

(W 0 + W f )/Wf. For other ratios the minimum rate will be larger than / 

but never larger than 4W f . , riQ , 

The correct function hit) to be used in (2.6-1) is known to be [18] 

hit) = — {sin[(W„ + W f )t] - sin(Wof)}. (2.6-2) 

TTt 

With some minor trigonometric manipulation, (2.6-2) becomes 

hit) = Sa ^ cos(oj q‘) (2.6-3) 

where the carrier frequency of the bandpass waveform is defined as 

<■. - w. + \ < 2 w > 

The waveform becomes 

fit) = " 2 , f(kT s )Sa ~ — cos [w 0 (f - kTJ] (2.6-5) 

TT -cc 

from (2.6-1). 

t Network output was /M/T, for hit) defined by (2.1-16). To get exact correspondence - 
that is, get an output /(()— the filter would have to be assigned a gain of T,. 

$ The ideally sampled signal and its spectrum are again given by (2.1-14) and 
(2.1-19), respectively. 


i 
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Brown [19] recently discussed direct bandpass waveform sampling, 
which is also called first-order sampling. We shall subsequently define 
various orders of sampling. 

Quadrature Sampling of Bandpass Signals 

We continue the discussions on bandpass signal sampling by observing 
that it is not necessary that a bandpass signal be directly sampled. It is 
possible to precede sampling by preparatory processing of the waveform 
[2, 20]. Such an operation will always allow sampling at the Nyquist, or 
minimum, rate if the signal is bandlimited. 

Let 

/(f) = r(t)cos[a) 0 / + </>(0] (2.6-9) 

be a bandpass signal having all its spectral components in the band o> 0 - 
(1*7/2) « M *£ 0 )„ + 0*7/2). By direct expansion we see that f(t) can be 

represented by 

/(f) = //( f)cos(o)of) - / Q (f)sin(<o 0 r), (2.6-10) 

where /,(f) and / G (f) are the in-phase and quadrature-phase components, 
respectively, given by 

//(/) = r(/)cosM>(f)l, (2.6-11) 

/ G (f) - r(f)sin[<^(/)]. (2.6-12) 

Both of these components are bandlimited to the band M « **V 2. By a 
few simple steps it should become clear to the reader that the network of 
Fig 2.6-2(a)will generate /,(f) and / G (f). The products can be implemented 
with balanced modulators. The lowpass filters are to remove ^spec^ral com- 
ponents centered at ±2oi 0 , while passing the band -W f /2 - w - W f /l. 

Each of the signals /,(f) and/ G (f) may be sampled atarateofVV^ 
samples per second and reconstructed as shown in Fig. 2.6-2(b). By forming 
^products indicted in (b) w« may recover /W. Thus we may sample 
an arbitrary bandlimited bandpass signal at a total rate of Wy/fr sampl 
per second, using preprocessing, regardless of the ratlo °f“ 0+ ^ // ^ 
and oio - 0*7/2), and recover /(f). Notice that this is different from the 
sampling discussed before because we now have two samples being trans- 
mitted, each at a rate 1*7/ 2ir samples per second, instead of one sample 

at a rate 2W f l2ir. 

Because of the preprocessing of /(f), the components /(f) and / c ( ) 
are independent and may be independently sampled according tc .the lowpass 
sampling theorem. The two trains of sampling pulses do not have to have 
the same timing; one can be staggered relative to the other. They may e 
interlaced, forming a composite sample train at a rate o> s s* 2 **>, which 
alternately carries samples of/ ; (f) and/ e (f). In quadrature sampling a means 
must be provided in the receiver to separate the two sample trains. 
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2 cos (u 0 t) 



-sin(u„r) 

(b) 


Figure 2.6-2. Quadrature sampling of a bandpass signal /(f). (a) Quadrature 
signal generation prior to sampling and (b) waveform recovery from samples [6]. 

Bandpass Sampling Using Hilbsrt Transforms 

Another form of preprocessing prior to sampling uses Hilbert transforms 
[2, 21]. As usual, let /(f) be a bandpass signal bandlimited to W 0 *£ |o>| =£ 
W 0 + 1*7 , where its center frequencyjs w 0 = W 0 + (1*7/2), and let /(f) 
be its Hilbert transform.* The signal /(/) can be generated by passing /(f) 
through a constant phase shift of -ir/2 for > 0 and tt/2 for a> < 0. 
Samples of both /(f) and /(f) are adequate to determine /(f) [2] according 
to 

/(f) = £ IfikTJhdt ~ m + f(kT,)h 2 (t - kT s )l (2.6-13) 

oo 
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0 \w/ 

hiit) = Sa [ 2 J C0 

o W / 

h 2 (t) = -Sa -f 


COS (co 0 f) 


sin(co 0 f). 


(2.6-14) 


(2.6-15) 


The sampling period assumed in (2.6-13) is 

T (2.6-16) 

s W/ 

which corresponds to to, = W„ half the Nyquist rate. However, there are 
two functions being sampled so the total (average) sampling rate is equal 

to the Nyquist (minimum) rate. . . 

It can be shown (see Prob. 2-39) that (2.6-13) is exactly equivalent to 
quadrature sampling when samples of /,(/) and/ e (f) occur simultaneously 
and are taken at the minimum rates. 

Sampling of Bandpass Random Signals 

Because the quadrature sampling technique is a general one, and since 
it reduces the sampling representation to one of representing lowpass functions 
(in-phase and quadrature components), it can be used for random signals 
as well. We show this fact in the following development. 

We shall draw on work in Appendix B. Let N(t) be a zero-mean, 
wide-sense stationary random process bandlimited to the band 
W 0 +- W N centered at a carrier frequency co 0 = W 0 + (W N /i). nw nas 
the general representation 

N(t) = N c (t) cos(w 0 t) - N s U)sm(o) 0 t) (2.6-17) 

from (B.8-2). Here N c (t) and N s (t) are zero-mean, join^ 

processes bandlimited to |o>| « W N / 2. By applying (2.2-5) to N c (t) and N,(» 

we have 


5/(f) = N c (t)cos(ai 0 t) - Nj(r)sin(ca 0 f) 


= ^{ £ * 
ITT U*-oo 

-¥{i 


N c (kT s ) Sa 


j)Sa — 


u - m 
2 


>cos(co 0 f) 


N s (kT s + t 0 )Sa 


W„U - KT, - tp) 
2 


|sin(Gi 0 f 


(2.6-18) 

as the quadrature sampling representation of N(t). It can be shown (see 
Prob. 2-41) that NUj = N(t) with zero mean-squared error. The parameter 
to is a constant representing the shift in timing of the sa. nples of AW) 
relative to the samples of N c (t); it is given by - T s « f 0 « T, . The sampling 
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period T, in (2.6-18) must satisfy T, =£ 2tt/W n because the Nyquist rate 
per quadrature component is W N (rad/s). 

*2.7 OTHER SAMPLING THEOREMS 

Although there will be no need to invoke them in the following work, we 
shall briefly study some other sampling theorems. 

It will be helpful to recall and use the network view of the sampling 
process. For ideal sampling of a lowpass signal we found that the sampling 
theorem representation for a signal /(f) could be modeled as shown in Fig. 
2.1-2. We interpreted the filter as a reconstruction filter in the receiver 
which used instantaneous (impulsive) samples as its input. 

Even in cases where impulses were not transmitted, such as with 
natural or flat-topped pulses or the generalization to an arbitrary sample 
pulse q(t), we still found that an ideal lowpass filter was required/ The 
effect of the shape of q(t) only caused the need for an equalizing filter with 
transfer function l/£>(&>) with q(t) <-> Q( a>). These comments are incorporated 
in the network interpretation of the lowpass sampling theorem shown in 
Fig. 2.7-l(a). Since Q(co)H ei! (oj) = 1, there is no overall effect on the 


Present Only if Samples Sent 
over Channel Are Not Impulses 



U 0= Z SO-kT,) 

k m —•» 


(b) 

Figure 2.7-1. Transmission and reception of sampled signals, (a) System using 
samples having arbitrary sampling waveform and (b) equivalent system [6], 

t This fact is strictly true only if a, = 2W,. 
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recovered output due to arbitrary pulse shape, and the equivalent repre- 
sentation of (b) applies to any of these sampling methods based on instan- 
taneous samples, t 

Highor-Ordor Sampling Defined 

The lowpass sampling theorem may be summarized by writing /(f) in 
the form 

/(0 = £ f(nT,)h(‘ - *T,), (2-7-1) 

rt — — 00 

where we use the model of Fig. 2.7-l(b). The filter impulse response must 
be 

h{t) = ^ SalWyf] (2-7-2) 

7 r 

in order for (2.7-1) to equal (2.1-9). 

More generally, we may allow /(f) to be either lowpass or bandpass, 
but still a bandlimited, function. We now define #,(f), equal to the right 
side of (2.7-1), as a first-order sampling of /(f) [18]. Thus first-order sampling 
involves a single train of uniformly separated samples of /(f). By extending 
the idea we define a pth-order sampling of /(f) as 

f(t) = ^ Sii t) = 2 £ f( nT * + T /)W* ~ nT *‘ ~ T,) ' (2 ‘ 7 ' 3) 

/-I 1-1 ft- 

Here we have p functions like (2.7-1). Each has uniformly separated samples 
T s , seconds apart. Each train has a time displacement r, relative to the 

chosen origin. . a . 

The general problem in higher-order sampling is to find the n,(t) which 
make (2.7-3) valid [18]. Only certain special cases are of usual interest. 
These are: (1) first-order lowpass signal sampling, (2) second-order lowpass 
signal sampling, (3) first-order bandpass signal sampling, and (4) second- 
order bandpass signal sampling. We have already discussed (1) and (3 1 in 
the preceding work. In the following paragraphs we discuss (2) and (4). 
In addition we shall also consider a sampling theorem for lowpass sampling 
in two dimensions. 

Second-Order Sampling of Lowpass Signals 

Here p = 2. Let r, = 0, r 2 = r, T a = T„ = 2v/W„ the maximum 
sample interval (smallest sample rate allowable for each pulse train). The 
sample times of the first train are then nlsr/Wf, while those of the second 
train are {n2n/W f ) + r. Figure 2.7-2(a) and (b) show the two trains of 


t Note that we have assigned a gain T s to the filter for convenience. 
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Figure 2.7-2. (a) and (b) are sampling pulse trains used to produce the second- 
order sampled signal of (c). The message reconstruction method is shown in (d) 
[ 6 ], 
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sampling impulses, where we define T s = TJ2. The composite sampled 
signal is 

fft) = £ f(nT„)b(t - nT„) + £ f{nT„ + t)8(/ - nT sl - r), (2.7-4) 

which is illustrated in (c). . , 

For a real time function /(r) the necessary reconstruction filter impulse 

responses are [2] 


MO = 




cos(W/t/2) 


(2.7-5) 


ri w i j W f t sin(W f T/2) 

MO = M~ 0- (2 ' 7 " 6) 

The receiver reconstruction of /(/) is illustrated in Fig. 2.7-2(d). 

Probably the main advantage of second-order sampling of real lowpass 
time functions is that a nonuniform sampling may be used. 


Consider thl Special case of second-order sampling where the time difference between 
sampling pulse trains is 

Li-T = — 

T ~ 2 ’ W/ 

In this case the composite train of pulses is uniformly separated by the sampling 
period T„. and one might suspect that second-order sampling would reduce to first 
order. The filter responses (2.7-5) and (2.7-6) reduce to 

MO = MO = ^f 1 = SaW. • 

When this expression is used in (2.7-3) the result is found to be the same as (2.1-9) 
when T s = ir/W f . - — 


Second-Order Sampling of Bandpa** Signals 

' The advantage of second-order versus first-order sampling of bandpass 
waveforms is that the minimum (Nyquist) rate of sampling is allowed for 
“V «**? <* w. a* Furthermore^ the 

Again letting r, = 0, r 2 = r, 7 S , - l Sl - "/> Ulv 

signal is given by (2.7-3) with [2] 

coslmWfT - (Wp + W r )t] - cos {mWfr - [(2m - 1 Wj _ — 
hM ~ Wft sin(mW/r) 

cos{[(2m - l)Wrr/2] - [(2m - l)Wf - W 0 ]t} ( 2 . 7 - 7 ) 

+ W f t sin[(2m - l)W/r/2] 


. 
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Wft sin[(2m - l)W/r/2] ’ 

MO = M-0. (2.7-8) 

Here m is the largest integer for which (m - 1 )W f < W Q and r is arbitrary, 
except that it may not be an integral multiple of ir/W f unless (m - l)W f 
= VP 0 [2]. In the latter case WJW f is an integer, and a development based 
on first-order sampling can be used to give 

m = £ KnT,)h(t ~ nT s ), (2.7-9) 

n =» — oo 

h(t) = — {sin (mWft) - sin[(m - 1 ) W r r] } (2.7-10) 

7 Tt 

where samples are independent. 

We observe that (2.7-10) agrees with (2.6-2) if we allow for the fact 
that (m - 1 )W f = W 0 . 

Lowpass Sampling Theoram in Two Dimensions 

As a final topic in sampling theory we shall consider the uniform 
sampling theorem in two dimensions for lowpass functions [22]. Although 
the theorem has little application to ordinary radio communication systems, 
it is important in optical communication systems, antenna theory, picture 
processing, image enhancement, pattern recognition, and other areas. 

It is difficult to state a completely general theorem owing to factors 
which we subsequently discuss. t However, we state the most useful and 
widely applied theorem. 

A lowpass function/)/, u), bandlimited such that its Fourier transform F( o>, cr) 
is nonzero only within, at most, the region bounded by |o>| W fcl and 
|cr| =£ W fa , may be completely reconstructed from uniform samples separated 
by an amount T sl *£ ir/W fa in / and an amount T su it/W fa in u. 

In following paragraphs we shall prove this theorem and discuss some 
additional fine points. In particular, we shall find that 

fit, u) = 2 S f( kT «> nT su ) Sa ir(jr ~ ^jsaj^ir^ - nj , 

(2.7-11) 

which is the two-dimensional extension of the one-dimensional theorem. 
Although we only consider two dimensions and sampling on a rectangular 
lattice, the theorem can be generalized to N-dimensional Euclidean space 

f Recall that, even in the one-dimensional case, the lowpass theorem was not the most 
general theorem which could have been stated. 
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with sampling on an N - dimensional parallelepiped [22] as well as other 
lattices [23]. It can also be extended to nonuniform samples L24J. 

The proof of (2.7-11) amounts to postulating the function 

f(t, u) = 2 2 /(**> “«)*(* ~ tk,u - u n ), (2.7-12) 

it* — oo n* — 00 

WherC 4 = «•.. (2-7-13) 

„ =nT (2.7-14) 

— rll su i 

and finding an appropriate function hit, u) which will make (2.7-12) true 
for bandlimited f(t, u). The function hit, u) is called an interpolating function 

and as will be found, its solution is not unique. 

’ We first extend the definition of the delta function to two dimensions 

as follows 

f f <#>(£ - r)) 8 (f - x,r\ - y) d^ dr, = 4>ix, y), (2.7-15) 

J — OO — OO 

where v) is an arbitrary function, continuous at (*, y). Now using the 
fact that 

5(£ - x, r, - y) = ^ ~ x)S(v ~ y), (2.7-16) 

we may apply (2.7-15) to write (2.7-12) in the form 


fit , u) = 2 2 f J* 

• - t 

/" oo /*oo 

= I I A€,vM 

J — oo — oo 


/(£, T))/l(t -i,U -V) 


■ S(i - t k , V - u„)dfdr) 

M, v)h(t - (,U- rj) 2 8 (f “ **) 


• 2 8 ( t ) - «„) dr), 

(2.7-17) 

if it is assumed that the order of integrations and summations may be 
reversed. It is known that ^ 

m-kTJ-jri 0 - 7 - 18 > 

where the sample rate in t is 

= 2,/T.. (2-7-19) 

Bv defining the sample rate in u as 

_ wr (2.7-20) 

w JU = 2 tt/ i su , v 


i 
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an expression similar to (2.7-18) can be written. Substitution of the two 
expressions into (2.7-17) gives 

At, u ) = 



h(t - t,u - ri) 


1 

Ts,T su 



2 


/(f, rj)e Jka " (+J "°“‘ r> d£ dr,. 


(2.7-21) 


This expression is recognized as the two-dimensional convolution of 
the two functions hit, u)/T„T su and 


fit, u) = 2 2 At, (2.7-22) 

— oo n * — oo 

If the Fourier transform of / s (t, «) is defined as F s ico, cr), the frequency 
shifting property of Fourier transforms gives 


Ffco, cr) = 2 2 f(w - feu,, , cr - nco su ). (2.7-23) 

k = — oo n ” — oo 

Next, we recognize that the Fourier transform of a convolution of two 
functions in the t, u domain is the product of individual transforms in the 
a), cr domain. The transform of (2.7-21) is then 

Fico, cr) = £ 2 F(o> - kco sl , a - ncoj, (2.7-24) 

*St*SU ife*-oon*-oo 

where Hi o>, cr) is defined as the transform of hit, u ). 

This expression allows us to determine the required properties of the 
interpolating function hit, u). Figure 2.7-3 will aid in its interpretation. 
The bandlimited spectrum Fico, cr) is illustrated in (a). The double sum of 
terms in (2.7-24) is illustrated in (b). Clearly, if the right side of (2.7-24) 
must equal Fico, cr), two requirements must be satisfied. First, <o s , =* 2 W f „ 
and co su 3= 2 W fa . are necessary so that the replicas in (b) do not overlap, 
and second, the function Hi co, a)/T sl T su must equal unity over the aperture 
region in the co, cr plane occupied by F(co, cr) and must be zero in all regions 
occupied by the replicas. In the space between these regions Hi co, <t)/T s ,T su 
may be arbitrary. Hence there is no unique interpolating function in general. 
The first requirement establishes the sampling intervals stated in the original 
theorem. 

Regarding the second condition, we may ask what interpolating function 
should be used. There is no one correct answer. However, suppose we 
select 
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(a) 



Figure 2.7-3. (a) A two-dimensional, bandlimited Fourier transform and (b) its 

periodic version representing the spectrum of the sampled signal [ ]. 


his choice has the advantage of admitting all aperture shapes within the 
rescribed rectangular boundary. By inverse transformat.on 


’ m )= ( 2 ^ 

r« w/2 t rut-li 

_ Zk f *** du — I e j<ru da (2.7-26) 

2t 2t 


//(o), a)e J “' +i ‘ TU doo da 

" OJm/2 


= Sa 


Sal?. 


ly substituting this expression bank into (2.7-12), we have finally proved 
kp original theorem embodied in (2.7~11)» . , « « 

The interpolating function defined by either (2.7-25) or ( . - ) is ca 


the canonical interpolating function [22], It is orthogonal in the t, u space, 
which means that samples of fit, u) are linearly independent. 

For the rectangular sampling plan the canonical interpolating function 
may give 100% sampling efficiency. Let sampling efficiency rj be defined 
as the ratio of the area A a in the to, a plane over which F(w, a) is nonzero 
to the area A s of the rectangle defining the repetitive “cell” due to sampling 
[22], Since A s = w sl ui su , we have 



Aq 


(2.7-27) 


Efficiency is maximized by sampling at the minimum rates. By assuming 
this to be the case, A a is maximum for F(co, a) existing over a rectangular 
aperture. The corresponding efficiency is 77 = 1.0, or 100%. For comparison, 
it can be shown that maximum efficiency is 78.5% for either a circular 
aperture, or an elliptical aperture with major axis in either co or a directions. 


Example 2.7-2 

We note that the filter transfer function of (2.7-25) is arbitrary in the sense that the 
ideal filter can be more narrowband if W fu} < o>„/2 and W fa < <u,J2. We shall 
assume this to be the case and choose 

H( ca, a) = T„T tu rect(cu/ 2 W / Jrect(cr/2 W f f) 

so that a form of the sampling theorem that is more directly in the form of (2.1-9) 
can be stated. By inverse transformation 

hit, u) = ^Sa(ViV) ^^Sa(W>). 

7 T 7 T 

From (2.7-12) we have 

fit, u) = £ /(W„, nTJSa[W /u (7 - kT sl )]Sa{W f Ju - nTJ] 

77 k~-eo 77 (,= -00 

which is the two-dimensional form of (2.1-9). 


2.8 TIME DIVISION MULTIPLEXING 

As mentioned at the start of this chapter, one of the greatest benefits to 
be derived from sampling is that time division multiplexing (TDM) is possible. 
TDM amounts to using sampling to simultaneously transmit many messages 
over a single communication link by interlacing trains of sampling pulses. 

In this section we briefly describe time multiplexing of flat-top samples 
of N similar messages (such as telephone signals). The concepts are applicable 
to other waveforms to be developed in later work. 

The Basic Concept 

The conceptual implementation of the time multiplexing of N similar 
messages f n (t), n = 1, 2, . . ., N, is illustrated in Fig. 2.8-1. Sampled signals 
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Firnre 2.8-1. Time division multiplexing of flat-top sampled messages. Pulse 
trains of: (a) message 1, (b) message 2, and (c) the multiplexed tram. 


,ulse trains) for messages one and two are shown in (a) and (b). The pulse 
ain of (b) is delayed slightly from the tram of (a) to prevent overlap, 
ither messages are treated similarly. When the N total trains are combined 
nultiplexed), the waveform of (c) is obtained. The time allocated to one 
ample of one message is called a time slot. The time ^ter^ over which 
11 messages are sampled at least once is called a frame. The portion of 
he time slot not used by the sample pulse is called the 
?ig 2 8-1 all time slots are occupied by message samples. In a practical 
ystem some time slots may be allocated to other functions (for example, 
ignaling, monitoring, and synchronization). 


Suppose we'want to time-multiplex N = 50 similar telephone messages. Assume 
suppose wc wain r T , w _ 2 ~n 3)10 3 , and we must 

each message is bandhmited to 3.3 kHz. Thus / V 


sample each message at a rate of at least 6.6(10 3 ) samples per second. From practical 
considerations let us be limited to a sampling rate of 8 kHz and use a guard time 
equal to the sample pulse duration r. We find the required value of r. 

By equating 2r, the slot time per sample, to TJN, the allowed slot time, we 
get t = T S /(2N). But T, = 1/8 ms, so r = 10' 3 /(8 • 100) = 1.25 /rs. 


Synchronization 

To maintain proper positions of sample pulses in the multiplexer, it 
is necessary to synchronize the sampling process. Because the sampling 
operations are usually electronic, there is typically a clock pulse train that 
serves as the reference for all samplers. At the receiving station there is 
a similar clock that must be synchronized with that of the transmitter. 
Clock synchronization can be derived from the received waveform by ob- 
serving the pulse sequence over many pulses and averaging the pulses (in 
a closed loop with the clock as the voltage-controlled oscillator). 

Clock synchronization does not guarantee that the proper sequence 
of samples is synchronized. Proper alignment of the time slot sequence 
requires frame synchronization. A simple technique is to use one or more 
time slots per frame for synchronization. By placing a special pulse that 
is larger than the largest expected message amplitude in time slot 1, for 
example, the start of a frame can readily be identified using a suitable 
threshold circuit. 


2.9 SUMMARY AND DISCUSSION 

In this chapter we have demonstrated one most important fact, that a 
continuous waveform representing an information source can be completely 
reconstructed in a receiver using only periodic samples of the waveform. 
Thus the original waveform can be reformed, even at times between the 
samples, from just its samples. It is necessary only that the waveform be 
bandlimited (bandwidth W f ) and that instantaneous samples be taken at a 
high enough rate (the minimum rate 2 W f rad/s is the Nyquist rate). The 
sampling theorem was developed to prove this rather remarkable fact. 

In practice, waveforms are never bandlimited to have zero spectral 
components outside the band W f . However, there is always some frequency 
above which spectral components are negligible and can be approximated 
as zero; this practical frequency becomes W f , the signal’s spectral extent. 

For baseband waveforms the sampling theorem is developed for both 
deterministic (Sec. 2.1) and random, or noiselike, waveforms (Sec. 2.2). 
Even if samples are not instantaneous, sampling theorems are shown to 
apply when using practical sampling techniques (Secs. 2.3 and 2.4) and 
when using practical reconstruction methods in the receiver (Sec. 2.5). 
Sampling theorems for bandpass signals are also given (Sec. 2.6) and gen- 
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eralized theorems are stated for both baseband and bandpass waveforms 
(Sec. 2.7). 

By interleaving samples of several source waveforms in time, it is 
possible to transmit enough information to a receiver via only one channel 
to recover all message waveforms. The technique, called time division 
multiplexing, is briefly discussed (Sec. 2.8). Time multiplexing is one of 

the principal applications of sampling. 

In many modern-day communication problems, the information wave- 
form is analog, whereas the communication system is digital. The sampling 
theorem forms the basic theory that allows the waveform to be converted 
to a form suitable for use in such systems. We continue to develop these 
ideas in the next chapter. 
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PROBLEMS 

2-1. A signal /(f) = A Sa (co f t) is bandlimited to <o//2ir Hz. From the sampling 
theorem justify that only one sample is adequate to reconstruct /(f) for all 
time. 

2-2. The spectrum of the signal of Prob. 2-1 is F(co) = (A7r/a) / )rect(«j/2(u / ). /(f) 
is applied to a filter with transfer function 

J/W = rect (^J[ 1 + | fl „ cos (^)]. 

(a) Inverse transform the filter’s output spectrum to show the effect of 
distorting the signal’s spectrum, (b) How many samples are required if 
the filter’s output signal is to be reconstructed exactly from its samples? 
(c) What is the effect of replacing the cosines by sines? 
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2-3. Begin with (2.1-14) and show that the spectrum of the ideally sampled signal 
is given by (2.1-19): 

1 “ _ 2ir 

F,(o>) = - 2 F(a - <»>,). " t,' 

2 4 A nonrandom signal, handlisted to 17.5 kHz, is to be reconstructed exactly 
’ from its samples. It is sampled by ideal sampling and it is recovered in the 

Sv« “ » ktal lowpass filter, to) « U ■* 

allowed 9 (b) At what minimum rate must the signal be sampled. 

2-5. A system transmits ideal samples of its input message. 
is 25 kHz and the receiver uses an ideal lowpass filter 

The svstem is to be used with a nonrandom message bandhmited to 13 kHz. 
?an this^nessage be reconstructed without "on? If 
discuss how you might send the message over the system lw.th changesh 
2-6. Show by application of the lowpass sampling theorem that the bandhmited 
signal 

_ TTCO&(Wft) 

f(t) = (w/2 ) 2 - WO 2 

is the sum of two sampling functions separated in time by the sampling time 
T, = nlW f . (Hint: Choose TJ2 and -TJ 2 as the sample times.) 

2 8 A°message < has a bandhmited spectrum F(o>) = 10 trifai/tVy). & 1S ld ^ 

the '» 'ZZZfL* 

.h. fi.no - %■ • • - “ “, h ™ f “ do ; n 

the filter attenuates the edge of the first replica components at M - «, W f . 

„ I ,et fit) be bandhmited to - W, « « « and have the sampling representation 
of (2 1-9) (a) Find the bandwidth of the response g(t) of a square-law devic 
wheri g(t) -AO. (b) Write an equation for the sampling representation of 

/ 2 (f) in terms of samples of /(/). 

2-10 A signal /(f) is bandhmited to 5 kHz; it is sampled at a very high rate of 
100 kHz using ideal sampling (impulses). At another location thesampled 
Itenal is filtered by an ideal bandpass filter that passes a band of 15 kHz 
wfdth centered on 100 kHz. (a) Write an equation for the exact filter response 
signal, /„(f). (b) Discuss the form of the signal /»(r). 

_ „ . ■ . / _ f, u ( t )exo( - 3/) has a Fourier transform F,(co) - op + jm ■ 

iSy rn is ncU bandhmited . (a) Find the bandwidth of an idea, fiber 

that will allow the filtered signal, denoted by /(f), 

energy in /,«). (b) What is the minimum aUowable rate that fit) ca 

sampled without aliasing? 

2-12. Work Prob. 2-11 for the signal 

/,([) = 3 tu(t)e' u ** F\(a>) = 3[4 + Jw] ■ 

2-13. Work Prob. 2-11 for the signal 

/,(t) = 2 e' 51 ' 1 ** F ](<i>) = 20[25 + cu ] . 
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2-14. A signal is given by /(f) = 3cos 3 [2w(10 4 )/]. (a) Find the spectrum F( a>) of 
f(t). (b) What is the Nyquist rate for ideal sampling of /(f)? 

2-15. (a) Determine the minimum (Nyquist) sampling rate for the signal /(f) = /,(f) 
+ / 2 (f) where /(f) is bandhmited to W,(rad/s) and/ 2 (f) is bandhmited to 3W, 
with W, > 0 a constant, (b) Discuss how the minimum rate of part (a) 
compares to the respective minimum rates of samphng/(f) and/ 2 (f) individually. 

2-16. How would the conclusions of Prob. 2-15 change if/,(f) and/ 2 (f) were replaced 
by their respective squares, /5(f) and /5(f)? Discuss. 

2-17. A lowpass bandhmited signal /(f) has energy Ef. Find £/ in terms of the 
ideal samples of /(f) taken at the Nyquist rate. [Hint: Use (2.1-11).] 

2-18. The signal /(f) of Prob. 2-14 is ideally sampled at a 60 kHz rate and the 
sampled waveform is apphed to an ideal lowpass filter with bandwidth 30. 1 
kHz. Find an equation for the filter’s response. Is there aliasing? Explain. 

*2-19. A rectangular time function /(f) = A rect(f/T) that is not bandhmited is 
ideally sampled at a rate co 5 = 2W/, where W s is the bandwidth of an ideal 
filter to which the sampled signal is apphed. (a) Find W/SO that the fractional 
aliasing error is 1/12. (b) How many samples occur over the duration of 
/(f) for the sample rate to, = 2W f ? (Hint: Use /$ Sa 2 (f) dg = Si(2x) - 
x“'sin 2 (x) where the sine integral /» Sa (f)rff = Si(ot) is a tabulated quantity 
[25].) 

2-20. A nonrandom signal /(f) has an energy density spectrum % f (a>) = 24o> z [l + 
(to/W) 2 ]' 2 , where W > 0 is a constant. If /(f) is ideally sampled at a rate 
a> s = 5W, what fractional aliasing error occurs? 

2-21. A nonrandom signal /(f) has an energy density spectrum % f (u) = A[1 + 
(w /W) 2 ]' 1 , where W > 0 is a constant. At what rate must /(f) be ideally 
sampled if the fractional aliasing error is to be 0.01 (or 1%)? 

2-22. A waveform exists only over a total time interval of 80 p.s. If its spectrum 
is to be represented exactly by its samples taken every W,(rad/s) apart, what 
is the largest that W s can be? 

2-23. A baseband noise waveform is reconstructed exactly from its samples taken 
at a 60- kHz rate. If samples occur at three times the Nyquist rate, what is 
the spectral extent of the noise? 

2-24. A message /(f) = 6 cos(10 } f) is naturally sampled at a rate w, = 2.5(10 3 ) 
rad/s using pulses of duration 60 p.s. The amplitude of the sampling pulse 
train is K = 2 V. (a) Write an expression for the response of a receiver’s 
ideal filter (lowpass) of bandwidth W f = 1010 rad/s to the train of samples, 
(b) If W f is changed to 990 rad/s what is the response? 

2-25. Find and plot the spectrum of the sampled signal of Prob. 2-24. 

2-26. Show that if a message is a sinusoid of frequency u> f that the ideal filter in 
the receiver requires no equalization for distortion-free signal recovery so 
long as the sampling rate exceeds 2 ay and the filter’s bandwidth is slightly 
larger than <o f . What then is the effect of leaving out equalization? Assume 
fiat-top sampling. 

*2-27. Assume flat-top sampling and find and plot the magnitude of the equalizing 
filter’s transfer function to follow a lowpass filter for message recovery when 
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the sample pulse shapes are as given. 


(a) pit) = 


(b) pit ) = 


1-1* 


TTt 


A cos ( — ), 

T„ 


lo. 


elsewhere 

w <T i 

elsewhere 


.It 

(c) pit) = A tn^— 

2-28. Start with (2.4-1) and show that the spectrum of s„(t) may be written as 


£„(&>) = to, X P(no>,)8(a> - noj s ), 

n = ~oo 

where Pico) is the Fourier transform of pit). 

2-29. Show that C 0 of (2.4-4) equals Kt/T, with r given by (2.4-5). 

2-30. Pulses of the form given in Prob. 2-27(b) are used to construct a periodic 
sampling pulse train for natural sampling of a signal bandlimited to W f . What 
smallest pulse duration t p is required if t p is adjusted so that the spectral 
replica at l<n s is to disapppear (be made zero) in the sampled message when 
sampling is at the Nyquist rate? Sketch the spectrum of the sampled signal. 

2-31. Find the durations of the equivalent rectangular pulses of the waveforms of 
Prob. 2-27(a) and (b). 

2-32. Show that (2.5-4) follows from Fourier transformation of (2.5-2). 

2-33. Plot \H tq i<o)\ of (2.5-5) for 0 « <o « W f when Hico) is given by (2.5-4) with 
a), = 6 W f . How much does \H, q ica)\ vary over the band? Would you consider 
the equalizer necessary in practice for this problem? 

2-34. The spectrum F(io) of a signal fit) is given by 

/a>-25\ „ ./a> + 25\ 

Fitt) = 8 tn(-y-j + 8 tn^— j— J ■ 

(a) Sketch the spectrum of the ideally sampled version of/(r) for sampling 
rates oj, = 20, 25, and 50 rad/s. (b) Can fit) be recovered exactly from any 
of the three sampled signals in (a) by a bandpass filter that passes the band 
20 =£ M =£ 30? If so, state which ones. 

*2-35. A bandpass nonrandom signal is bandlimited such that WJW f = 2.8 and 
W f /2ir = 10 4 Hz. It is directly sampled. What is the smallest allowable 
sampling rate if perfect recovery from its samples is to be achieved? 

*2-36. An engineer has a bandpass nonrandom message /(f) bandlimited to a bandwidth 
of 6 MHz. He wants to transmit ideal direct samples of fit) at the Nyquist 
rate, (a) If he has the freedom of choosing only the center frequency of fit), 
what is the smallest and the next three higher frequencies that he may select? 
(b) If he chooses the lowest allowable center frequency, is the waveform 
being sampled still bandpass? 


*2-37. A signal/)/) = 3 cos(100/)cos(350/) is to be sampled at the smallest allowable 
rate, (a) What is the sample rate? (b) Sketch the spectrum of the (ideally) 
sampled signal and verify that a bandpass filter passing the bands 250 *s 
|o>| s 450 rad/s will recover fit) without distortion. 

*2-38. A bandpass signal, bandlimited to a bandwidth W f , is directly sampled at a 
minimum rate 30% higher than the Nyquist (minimum possible) rate. Find 
the largest value that is possible for WJW f . How many values of WJW f 
are possible? [Hint: Use (2.6-6).] 

*2-39. Show that (2.6-13) is equivalent to quadrature sampling of fit) if samples of 
fiit) and f Q (t) occur at the same times and sampling is at the minimum possible 
rate. [Hint: Use the product property of Hilbert transforms. It states that 
the Hilbert transform of a product f(t)c{t) is fit)c{t) if /(/) is lowpass, bandlimited 
to |fe>| =s W/, and c(t) has a nonzero spectrum only for |a>| > IV/.] 

*2-40. Let N[t) be a zero-mean, wide-sense stationary noise bandlimited to W 0 =s 
|a)| =s W 0 + 1V N and have the representation of (2.6-17) with &> 0 = W 0 + 
iW N /l). (a) Show that the cross-correlation functions R NtN ,ir) and R n , Nc (t) 
are both bandlimited to |a>| =s W N / 2. [Hint: Assume (B.8-3e) applies and 
prove (B.8-3g) is true.] (b) Show that the cross-correlation functions are 
both zero when r = 0. 

*2-41. Prove that Nit), given by (2.6-18),_equals N(r) of (2.6-17) with zero mean- 
squared error; that is, prove that e 2 = E{[Nit) - Nit)] 2 } = 0. [Hint: Use 
results of Prob. 2-40.) 

*2-42. Use (2.7-5) and (2.7-6) in (2.7-3) with p = 2, r, = 0, r 2 = r and T,. = 
T Si and allow r -» 0. Show that [2] 

fit) = 2 { finT ,,) + (/ - nT,)f(nT,)} Sa 2 

n - -oo 

in the limit, where/)/) = dfit)/dt. This result shows that /(/) can be recovered 
from a sequence of its samples and samples of its time derivative, each taken 
at a rate cd S| = W f , or half the Nyquist rate. The average number of samples 
per second still equals the Nyquist rate, however, because there are two 
samples being taken in each sampling interval. 

*2-43. A lowpass function fit, u) has a Fourier transform F(o>, cr) that is nonzero 
only over a diamond shaped region having two of its points on the w axis 
at ± W fu , and the other two points on the cr axis at ± W fc . (a) If sam- 
pling rates are = 3 W fu and oi s , = 2.31V/„. find the sampling efficiency, 
(b) To what value will sampling efficiency increase if both sampling rates 
are reduced to the smallest allowed values? 

*2-44. Work Prob. 2-43 except let F(id, cr) be nonzero over an elliptically shaped 
region in the <n, <j plane. 

*2-45. Work Prob. 2-43 except assume F(a>, <t) / 0 only over a circular region in 
the cd, cr plane with radius W fa = W fa = W f . 

2-46. Devise a method of interlacing samples at one summing junction so that 
Nyquist rate sampling of five signals occurs when four signals are bandlimited 
to 5 kHz and one is bandlimited to 20 kHz. 


Wfjt - nT„) 
2 
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2-47. Work Prob. 2-46 using two summing junctions, one that sums samples of 
only the four 5-kHz signals and a second that multiplexes only samples of 
the 20-kHz signal and the output line of the first multiplexer. Discuss syn- 
chronization of the two multiplex operations. 

2-4*. A TDM system uses flat-top sampling pulses 0.7 ix s wide. If a guard time 
of 0.34 ix s is allowed and N = 120 telephone messages are multiplexed, 
what is the largest allowable message spectral extent? 


Chapter 3 


Baseband Digital 
Waveforms 


3.0 INTRODUCTION 

After the somewhat extensive discussions in the preceding chapter on sampling 
theorems, it seems appropriate to pause and place the direction of future 
developments in perspective. Again we refer to Fig. 1 .3-1 .t 

The overall purpose of this chapter is to describe the various functions 
of Fig. 1.3-1 as they relate to a baseband digicom system. We shall be 
concerned primarily with the analog-to-digital (A/D) converter, source and 
channel encoders, and the generation of the transmitted waveform (in the 
modulator). Discussions of the receiving subsystem functions are brief 
because they are basically just the inverse operations to those in the transmitter 
subsystem. When the transmitter functions are developed, the use and 
implementations of the inverse operations should be more or less obvious 
to the reader. 

Our discussions ultimately lead to the description of a number of 
important digital waveforms for transmission over the channel. However, 
because one of the significant advantages of a digicom system is its ability 
to interlace in time, or time-multiplex, digital waveforms of many messages, 
we also introduce the basic elements of this technique. 

t Other functions are sometimes included in the digicom system. Later we mention 
time-multiplexing of many waveforms. Other functions, such as data encryption and frequency 
spreading ( spread spectrum), are not covered here. Sklar [1] has given a good summary of 
these functions. 
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Because the digital conversion of analog signals is the only operation 
in the transmitter subsystem that involves analog waveforms and since the 
preceding sampling theory forms the initial part of this operation, we begin 
with this topic. All succeeding discussions can then deal entirely with digital 
concepts. 


3.1 DIGITAL CONVERSION OF ANALOG MESSAGES 

The A/D conversion of an analog message involves first sampling the message 
and then quantizing the samples. We assume the message/(f) to be bandlimited 
to W f (rad/ s) so that it can be reconstructed without error from its samples, 
according to the sampling theorem (Chap. 2), if samples are taken at a rate 
Wf /- tt (samples/ s) or faster. Thus sampling produces no error in the re- 
construction of the message, in principle. However, the process of quan- 
tization, the rounding of sample values to give a finite number of discrete 
values, discards some information present in the continuous samples, and 
the reconstructed signal can be only as good as the quantized samples 
allow. In other words, there remains some error, the quantization error, 
that is related to the number of levels used in the quantizer. 

Quantization of Signals 

Let the analog message /(/) be modeled as a random waveform. Define 
the probability density function (Appendix B) of f(t) at any given time t as 
£>/(/). A possible function is illustrated in Fig. 3.1-1. In (a) we have a 
message that possesses definite extreme values, whereas the message of 
(b) has no well-defined extremes, such as a Gaussian signal. 

The process of quantization subdivides the range of values of/ into 
discrete intervals. If a particular sample value of f(t) falls anywhere in a 
given interval it is assigned a single discrete value corresponding to that 
interval. In Fig. 3.1-1 the intervals fall between boundaries denoted by 
fhfit- ■ ■ ,/l+i, where we assume L intervals. The quantized (assigned) 
values are denoted by l lt . . . , l L and are called quantum levels. The 
width of a typical interval is/ +1 - f and is called the interval’s step size. 
If all step sizes are equal and, in addition, the quantum level separations 
/■+ 1 ~ If are all the same (constant), we have a uniform quantizer; otherwise 
we have a nonuniform quantizer. 

If all values of a message do not fall in the range of the quantizer’s 
intervals, which in Fig. 3. l-l(b) is from/, to f L+l , these values saturate or 
amplitude overload the quantizer/ The message of (a) does not overload 

t A practical quantizer for this type of message must be designed purposely to allow 
a small, controlled amount of overload. 
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(a) 



Figure 3.1-1. Probability density functions and quantization levels for (a) messages 
with well-defined extreme values and (b) those without such extremes. 


the quantizer but could if the quantizer levels were established on the basis 
of a certain message power level and then the incoming message’s power 
suddenly increased. An increase in message power corresponds to an at- 
tendant spread in the density function p f (f). Thus a quantizer must be 
designed not only for a form of message density but for a specific density 
that results when the message has a design power lev el. W e call this power 
level the signal’s reference power level, denoted by fl{t). 

Even when /(/) does not overload the quantizer, there is still error — 
the rounding, or quantization, error — associated with every interval of the 
quantizer. Consider an exact sample value / that falls in interval i, that is, 
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f^f **/+ 1 • The quantizer output is level /,, which differs from the exact 
sample value by the error 

«,=/-/„ i — 1, 2,..., L. (3.1-1) 

Once quantized levels /,• are generated and transmitted by whatever system 
is used, the best that any receiver can ever hope to do is recover these 
levels exactly. Actual message values can never be recovered. When a 
receiver reconstructs the message from its quantized levels, it will contain 
errors related to the various errors e, that occur during quantization. These 
errors place a limit on the performance of the overall system. 

Quantization Error and Its Performance Limitation 


We are interested in finding an overall mean-squared quantization 
error, denoted by e\ , which results from quantization. Let P/(f) represent 
the probability density function of f{t) and consider a quantizer defined by 
intervals and levels shown in Fig. 3.1-1. The mean value and power in the 
analog signal can be written ast 


l rft+\ 

M =7= / fpAf)df=2 / fpAf) df (3.1-2) 

- “ i-1 •'ft 

= f fpAf) df= 2 f fpAf) df. (3.1-3) 

The quantizer output for any one sample can be treated as a discrete 
random variable, denoted by f q , that has possible levels l u l 2 , ... , l L . 
The probability that f q will have a typical value, say /,, is just the probability, 
denoted by P,, that /falls in interval i. The mean value and power present 
in the quantizer output are obtained by averaging the discrete random 
variable f q over all its quantum levels: 


where 


l l rft*\ 

T t -2hP,-2i, pAf)df 

i=i i=i J f 

__ i l rfi* \ 

I pAf) df 

i-i ,-i 

rn*\ 

P, = I PAf) df- 

•J r, 


PAf ) df 


(3.1-4) 


(3.1-5) 


(3.1-6) 


Of course, it would be desirable for the mean value of the quantizer 


t Interval boundaries /, and f L+l are shown finite for ease of writing equations. All 
practical quantizers assign levels /, and l L when / s / and f L < /, respectively; this result is 
equivalent to setting/, = -<*> and/ t+ , = ■», which we assume in some of the following 
work. 
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output to equal the mean value of the analog input signal. If we require 
this to be true, (3.1-4) must equal (3.1-2). By solving the equality we have 

rfi-i rfi * 1 

/, I pAf)df= I fpAf) df i = 1, 2 ,..., L, (3.1-7) 

J Si J f, 

or 

/ fpAf) df 

l > = ' = >. 2, L. (3.1-8) 

I PAf) df 

In other words, the quantizer should choose its quantum levels equal to 
the conditional mean values of each interval. 

We are now able to find the mean-squared quantization error. From 
(3.1-1) the mean-squared error when the sample value falls in interval i is 

J rf<+i 

' (/ - l.fpA/V) df (3.1-9) 

fi 

where p f (f\i) is the probability density of / given that / falls in interval i. 
It is given by 

P//|0 = £ ^. (3.1-10) 

By averaging these mean-squared errors over all intervals we obtain the 
overall mean-squared error 

l l rfi+i 

^ = S 5 P, = 2 I (/ - hfpAf) df (3.1-11) 

i=i /-i 

By expanding the right side of (3.1-11) and substituting (3.1-7) and 
(3.1-5), we have 

7 q =f 2 -7l- (3-1-12) 

In other words, the overall quantization mean-squared error equals the 
difference between the power in the analog signal input to the quantizer 
and the average power in the quantizer output. 

To gain insight into quantization error and to interpret (3.1-12), we 
note that sampling and quantizing samples is equivalent to quantizing first 
and then sampling. By using the latter viewpoint we construct a possible 
quantized waveform prior to sampling, as illustrated in Fig. 3.1-2. For 
illustrative purposes we assume a uniform quantizer with step size Sv. 
Clearly, sampling the quantized message f q {t) is the same as sampling the 
waveform 

ff) = /W + e,(f) (3.1-13) 

where - e q {t) is an equivalent quantization error waveform having sample 
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3.2 DIRECT QUANTIZERS FOR A/D CONVERSION 
Uniform Quantizers — Nonoptimum 

We next examine a quantizer with a large number of quantum levels 
having constant step size Sv = f +l - f, i = 1, 2, .... L. In the next 
subsection we shall return to this uniform quantizer to discuss its performance 
with a smaller number of levels and see how it can be optimized by minimizing 
its mean-squared error. For the present discussion there will be L levels 
centered in equally spaced intervals between two extreme values / min and 
/ max . Thus 

8t> = /max ~ /min (3. 2 -l) 


h fmin I * 


; = 1,2,...,!. 


(3.2-2) 


Mean-squared quantization error, from (3.1-11), becomes 

_ r/m i» rh 

4= (f-h ) 2 P/f) df+ / (/ - /, ) 2 p f (f) df+ ■■■ 

•- (3.2-3) 

/“/max /*“ 

+ / (/ - k) 2 Pf(f) df + / ( f-l L ) 2 PAf)df ; 

The first and last terms represent overload noise, t which we denote by 
. If Sv is small enough (large L) so that pff) ~ constant = p//,) for 
each interval and N qi „, is relatively small, it is easy to show that the middle 
terms in (3.2-3) are approximately equal to (Si;) 2 / 1 2 (see Prob. 3-5). Hence 

-j ( Sv ) 2 

< = TT + N o.°‘> (3.2-4) 


where 


N„, ol = f (f ~ l,) 2 P/(f) df + f (f- k) 2 p f (f) df. (3.2-5) 

00 /max 

Signal-to-quantization noise power ratio becomes 

(s 0 \_i2m i 


(Su) 2 1 + [12/V,, o/ /(Su) 2 ]' 


(3.2-6) 


For a given message power p(t), (3.2-6) clearly shows that performance 
without overload is better for smaller step size Sv. At the same time the 


f Note that errors relative to level 1 corresponding to < / s / 2 are not treated as 
overload errors. Only errors relative to /, corresponding to/ =s / min are overload. Thus interval 
1 is broken into two parts for convenience in defining overload. A similar division of interval 
L has been adopted. 
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relative effect of a given overload noise power N q ol is enhanced by smaller 
step size. 

A special, but very useful, case of (3.2-6) can be given for messages 
having a definite maximum amplitude. If the message has zero mean with 
a symmetric probability density function and crest factor K cr , we let 
1/rMlmax , the maximum magnitude of /(f) at its reference power level, t 
establish the extreme range of the quantizer. In this case we have 

,(rnin ./max l./r(0|max tUld 

SU = ^0^ = ( 2 4-)v'— (3.2-7) 


So) 3l 2 r ad ' _ 

N J K i l flit) J 1 


+ [3 L 2 N q ,JKl r fM 


(3.2-8) 


Here 7V,, 0/ = 0 if/lfj ^flit). 


Example 3.2-1 

We let f(t) be uniformly distributed on (-|/(0|m«, 1/WlmJ and solve (3.2-8). Message 
power is 


^ r- 2 _4_ m \f^ 

J ’ J-mJ 21/L, 3 ' 


Reference level power is then flit) = |/,(r)|L,/3. Hence K], = 3. As long as 
flit) 3* f(t), we have N q , 0 , = 0 so 


Sf = L t A A . 

N J [flit) J’ /'(O 


From (3.2-5) we have 


/ “l/rlnux T 5 *12 

/+ I/.U.X-T : 
l/lmo L 2 J 

♦/"“[/- w_.sl 

J \Mm, L 2 J 


df 

2|/Lx 

2 df 

2|/Lx’ 


l/lmax > l/r|m 


Solutions of these integrals are straightforward. By using the fact that 2[f r (t )\ m „/ 
Sv - L, (3.2-8) becomes 




2 - 3 L + 3 ) 
3 L l 


t Reference power level is th e maxi mum power level for which the quantizer does not 
saturate, that is, N,,„, = 0 for f(t) «/, 2 (r). Crest factor for a signal f(t) is defined by K], = 

i/WILx/At). 
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for R > 1, where we define 

* 2 =m. 

fin) 

A plot of iSJN q ) is shown in Fig. 3.2-ljor L = 16 and 128. Note that iSJN q ) 
decreases abruptly as message power f\t) exceeds the reference (design) level 
flit), which corresponds to the onset of amplitude overload. 

The previous example served to illustrate the use of (3.2-8), which 
applies to messages having a well-defined maximum magnitude. For messages 
with no such maximum (Gaussian, for example), one can still use (3.2-6) 
when L is largo. However, there now exists the problem of where to set 
the extreme quantum levels (/, and l L ) in relation to the probability density 
function of the message when it is at some reference power level. For a 
given number of levels (either large or finite) too small a quantization range 
results in excessive overload noise power. A large range gives less saturation 
but gives increased in-range quantization error. These extremes suggest 
that an optimum design may exist somewhere in between that will minimize 
overall quantization error power. We next consider this problem. 



frHt) 


Figure 3.2-1. Performance of a uniform quantizer for a uniformly distributed 
input message. 
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Optimum Quantizer* 

It is well known [3,4,5] that quantizers having nonuniformly separated 
quantum levels can be found that give smaller mean-squared quantization 
error, e 2 , than those having uniform levels. Equation (3.1-11) gives e\ for 
any quantizer and can be written as 

3- rv t\) 2 Pf(f)4f + z r (f-ifPAndf 

j -OO iml Jfi f'l 0_Q\ 


(3.2-9) 


+ f ( f-l L ?PAf)df- ■ 

J fL 


The optimum quantizer is one that chooses levels /, and interval boundaries 
/■ so that s 2 q is minimized. 

By applying Leibniz’s rule, (3.2-9) can be differentiated to obtain a 
set of necessary conditions that must hold for the optimum quantizer [5]: 


-2 ffnl 

t-- 2 l 


)P/(f) df = 0, 


i = 1, 2,..., L, 


(3.2-10) 


= [(/ - h-i) 2 - (/• ~ lifWf) = 0, / = 2, 3,..., L. (3.2-11) 


(3.2-12) 


where/] = - °o and f L + , = Equation (3.2-11) is equivalent to 

f = (f Y'-) . / = 2, 3,..., L, (3.2-12) 

which says that interval boundaries /■ should fall midway between the 
adjacent quantum levels. An alternative form for (3.2-12) is 

h = 2/ - i = 2, 3,..., L. (3.2-13) 

Equation (3.2-10) is readily solved for l, 


/P/(/) # 


■Jf, 


i = 1, 2,..., L. 


(3.2-14) 


p/f) df 


This result was previously found in (3.1-8) to be necessary for the mean 
value of the quantizer’s output to equal the mean value of the input message, 
a fact noted by Bucklew, et al. [6] and others [4, 7], 

The solution of (3.2-13) and (3.2-14) for the general nonuniform quantizer 
is difficult. However, a procedure to obtain a solution by computer iteration 
has been given [4, 5], For a specified message probability density p f (f) 
and fixed value of L, /, is first selected arbitrarily (it should be a value in 
the tail of p f (f) that corresponds to small/). With/, = we solve 
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(3.2-14) for / 2 . Next, / 2 and /, are used in (3.2-13) to obtain l 2 . The cycle 
is then repeated to obtain / 3 from (3.2-14) and l 3 from (3.2-13). Continued 
iteration finally stops when l L is obtained from (3.2-13). If /, has been 
correctly guessed, then l L will satisfy (3.2-14) with f L+i = If it does 
not, /, is corrected to a new value and the process repeated until l L satisfies 
(3.2-14). This procedure satisfies the conditions (3.2-13) and (3.2-14), which 
are necessary for optimality. Sufficient conditions have also been given 
for the equations to be the optimal set [8], Recently, more general work 
has proved the existence of optimal quantizers [9-11]. 

Max [5] used the above procedure to find the quantum levels for a 
zero-mean Gaussian message and all values of L from 1 to 36. Paez and 



Levels in [12] are given for L = 2, 4, 8, 16, and 32. For these optimum 
nonuniform quantizers, Fig. 3.2-2 illustrat es w here the highest quantum 
level l L is set in relation to the rms value [/;/)] 1/2 of the reference signal 
for which the quantizer is optimized. t The resulting signal-to-quantization 
noise power ratio is shown in Fig. 3.2-3. We note that the curves are not 
more than 3.2 dB apart for any value of L shown. Thus performance does 
not change radically for the densities illustrated. However, the required 
ratio of l L to [/?(') ] 1/2 , called the loading factor, changes greatly with both 
L and message type. 

The procedure of Max [5] can also be applied under a constraint that 
steps l, - are constant for i = 2,3, .... L. The result is an optimum 
uniform quantizer. Figures 3.2-4 and 3.2-5 illustrate loading factor and 
( SJN q ) for Gaussian, Laplace, and gamma densities. Performance now 


t The gamma density is a reasonable approximation to real speech messages [13, 12]. 
t Because of symmetry in h = -k- 
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Number of Quantum Levels, L 

Figure 3.2-2. Loading factor for optimum nonuniform quantizer for several types 
of message (after [5, 12]). 


depends more heavily on the type of message than in the nonuniform 
quantizers, but loading factor is not as sensitive to message type. 

Other comparisons of uniform and nonuniform optimum quantizers 
are given in [14]. For large values of_ L, an approximation of 
Mauersberger [15] for the minimum value of e, applicable to a nonuniform 
quantizer can be used to extend the curves of Fig. 3.2-3. We have 


S \ L 2 

Nj “ 1 + [a(p) - 1][1 - {N b + !)«-"*]’ 


(3.2-19) 


Pf(f) = 


(3.2-21) 


where 

N„ = log 2 (L) (3-2-20) 

, 27 1 '. [HI + \Mf ,3.2-21) 

1 m + 3/») 

and v is a real number that determines the form of pjif) through the 
generalized Gaussian distribution [15] 

.... _ vycxpj - Tjl/P 
PfU) im/v) • 


(3.2-22) 
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Number of Quantum Levels, L 


Figure 3.2-3. Performance of optimum nonuniform quantizer for several types 
of message (after [5, 12]). 


Here 



and r(x) is the gamma function [16]. When v = 1 and 2, (3.2-22) gives the 
Laplace and Gaussian densities, respectively. As v -* °o, p f {f) tends to 
the uniform distribution [15]. Equation (3.2-19) is claimed to be accurate 
to about 2.5% (or 0.11 dB) for v > 0.6 with accuracy degrading to about 
15% (or 0.61 dB) for v = 0.3. 

Example 3.2-2 

We shall use (3.2-19) to verify the performance of a nonuniform quantizer for a 
message having a Laplace density and L = 32 levels. Here v - 1, so (3.2-21) 




Loading Factor 



Number of Quantum Levels, L 


Figure 3.2-4. Loading factors for optimum uniform quantizer for several types 
of message (after [5, 12]). 



Number of Quantum Levels, L 

Figure 3.2-5. Performance of optimum uniform quantizer for several types of 
message (after [5, 12]). 


70 


Sec. 3.3 Companded Quantizers 


71 


gives 

, , 27H2) 27 27 

0(V) H4) 3! 6 ' 

From (3.2-20) N b = log 2 (32) = 5. Thus 

l 

= 234.943 (or 23.71 dB). 

[l - 6<r 5 ] 

The actual value in Fig. 3.2-3 is 23.83 dB, from [12]. The approximate result is 
then accurate to -0.12 dB if the result in [12] is assumed to be the true value. 

When the message has a uniform density it can be shown by considering 
(3.2-14) that the optimal quantizer is uniform. 

3.3 COMPANDED QUANTIZERS 

Compandors and Their Optimization 

Let us reconsider (3^1-1 1), the basic equation describing the mean- 
squared quantization error e\. Again, (3.2-13) and (3.2-14) become conditions 
that must be satisfied by any quantizer giving minimum mean-squared error. 
We now assume that L is large enough that pjif) is nearly constant in 
every interval and that the message at its reference power level has definite 
extreme values, so that /, « f r (t) « /*,+ 1 . This last assumption can be 
relaxed if pff) exceeds the extremes/, and/ i+1 with negligible probability. 

If A, is defined as the width of the quantization intervals— that is, if 
A, = f +l - f — we find from (3.2-14) that /, falls in the center of the 
interval. Thus 

/,♦. - h = f (3-3-1) 

f ~ l, = -f (3-3-2) 

for (' = 1, 2, . . . , L. With these results and our assumptions, (3.1-11) 
becomes 

3 = £ £ V - tfpAf) i pa) J f (f - f.) 2 a 3 

= ^ i ^ p/m,. 

At this point we observe that a nonuniform quantizer can be constructed 
by first passing the message through a suitable nonlinear network and then 
using a uniform quantizer [3, 7], Figure 3.3-1 illustrates the network’s 
characteristic, denoted by v(f), and the way in which input and output 
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which we seek to minimize by finding the optimum v(f). The necessary 
procedures are beyond our scope, t so we shall give only the results. For 
a symmetrical message where Pf(-f) = P/(f), the compressor characteristic 
has odd symmetry, v(—f) = — v(f), and the minimum e\ and optimum 
v(f) are given by [3, 4, 7, 18] 



vf oP y\odi 

V(f) = , 0 «/=£ V. (3.3-7) 

/ P'/\f ) df 
-'0 

Here V is a constant defining the extreme input-output levels according to 
~V=fi «/«/ i+ i = V, (3.3-8a) 

-V = v(f) « u(/ i+1 ) = V (3. 3 -8b) 

and 

c 2V 

= y- (3.3-9) 

To summarize, (3.3-7) is the compressor characteristic that gives the 
smallest possible mean-squared quantization error e\ , given by (3.3-6), when 
a large number of fixed levels is used. The message is assumed to have 
zero mean, a symmetric probability density, and to be bounded such that 
l/Wl < V. The quantizer’s boundaries are set at ± V according to (3.3-8). 
If the message can have amplitudes of magnitude larger than V, the levels 
-V, V need to be set so that the probability of these amplitudes giving 
overload is negligible. 

Of course, the minimum valu e of e 2 q occurs only for the message’s 
reference (design) power level f 2 r (t). If message power f 2 (t) falls below or 
above the reference _value the quantizer is said to b e varia nce (power) 
mismatched [19] and e- incre ases. T he in crease for f\t) f;(t) is not nearly 
so severe, however, as when f-(t) > f 2 r (t) occurs. Shape mismatch can also 
occur if p/{f) does not have the same shape (form) as that assumed in 
quantizer design. Shape mismatch is most serious when p/(f) has tails 
that are larger than those for which the quantizer is designed (for equal 
message powers). 

We shall consider an example to demonstrate some of the above points 
and to illustrate one of the disadvantages of the optimum quantizer. 

t The procedure uses the method of Lagrange multipliers, which leads in our case to 
the application of the Euler-Lagrange condition [17, pp. 627-633]. 
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Example 3.3-1 

The Laplace density 

i /-Vil/A 

M/) ' V 7 “ P \ v? J' 

where T 1 = f\t), is a fairly representative model of the statistical characteristics of 
a speech message. We find the optimum compressor and mean-squared quantization 
error for this compressor when message power can vary from its reference leve 

fl for which the the quantizer is designed. „ 

For v{£) we solve (3.3-7) when the density P/(/) is defined at the reference 
power level f 1 ,. Easily solved integrals are involved and the result is [18] 

V [1 — exp(- V2//3v7;)i 

V(f) ~ 1 - exp(-\/2V/3v7|) ' 

The slope of v(f), by differentiation, becomes 

y'2V 2 /9f; ex p[- V 2/~/9/^] 

W(/) “ 1 - exp[-V2VV9^] 

On substitution of pff) for arbitrary message power f and w(f) into (3.3-5) and 
using (3.3-9), we obtain ej. Signal-to-quantization noise power then is found to be 



9 [1 - exp(- V2V 2 /9/^)l 2 [l - exp{-[V/j// 2 - (2/3)] V2V 2 //]}] 

This function is plotted in Fig. 3.3-2 for V l\ff\ = 10. 20> 40 - By selectin 8 

L = 128, for example, the best performance occurs for f = /, and is 

in dB = 10 log 10 (128 2 ) - 6.53 = 35.61 dB. 

It is clear from the curves of Fig. 3.3-2 that power mismatch limits 
the dynamic range over which ? can vary while maintaining minimum 
prescribed value of (SjN q ). Mismatch is most severe for / (f) > } M). 
Limited dynamic range is a major disadvantage of the optimum compandor. 

Logarithmic Companding 

Compressor characteristics can be found that differ from (3.3-7) and 
give less sensitivity to signal power mismatch than the optimum. Naturally 
there will be some loss in performance [smaller ( SJN q )], but the added 
dynamic range may make the loss acceptable if it is not too large. The n- 
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Figure 3.3-2. Performance of optimum quantizer for a message having Laplace 
probability density function. 


law compressor has such desirable properties; its characteristic is 

t r\ v / r\ hi[l + (p.|/l/V'XI i im 

v(f) = V sgn(/) ln(1 + M) — (3 - 310) 

where sgn(-) is the signum function,! ln(-) represents the natural logarithm, 
fi s= o is a constant, and V is the largest input signal magnitude that can 
be quantized without overload. Figure 3.3-3 shows the behavior of (3.3- 
10). In practice the value p = 255 is used in telephone systems in the 
United States [20]. 

If we again assume that the number of levels L is large and the 
message’s probability density is symmetric, we may find quantization error 
e 2 by use of (3.3-5). It can be written as 

4 = :£ f w-\f) P M) df, (3.3-11) 

jL -J - v 

where 


_ dv(f) p[l + U4f\ /v)]- 1 

df ln(l + p) 


t The signum function is defined by sgn (/) = 1 for /> 0 and - 1 for / < 0. 




76 


Baseband Digital Waveforms Chap. 3 



Sec. 3.3 Companded Quantizers 77 



Figure 3.3-4. Performance of /u-law compandor for p. = 255 and p. = 0 when 
the message has a Laplace probability density. The optimum compandor’s curve 
is shown for comparison. 


(3.3-15), when (3.3-16) is used. The case fi = 255 is shown, since this 
value and the Laplace density both apply to voice transmission. Also shown 
for comparison is the response of the optimum compandor of Example 
3.3-1 .t The /x- law compressor results in about a 4-dB loss in performance 
but clearly has an improved dynamic range! compared to either the optimum 
compandor or a system with no companding (/z = 0 curve). 

Another common logarithmic compressor used in Europe has a char- 


t The abscissa in Fig. 3.3-4 is obtained by subtracting 10 log 10 (V7/r) from the abscissa 
of Fig. 3.3-2. 

t Not sh own are the effects of amplitude overload, which become significant in the ft- 
law device for f 7 (t)/V* larger than about - 18 dB (for ti = 255 and L up to 256). 
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acteristic called A-law [21] given by 


v(f) = 


V(1 + In A) I -rr 


V(1 + In A)" 


(3.3-17) 


v(-f) = - v{f ), (3.3-18) 

where A ^ 1 is a constant. A = 1 corresponds to no companding. Curves 
of (3.3-17) appear similar in form to those of Fig. 3.3-3. A-law companding 
gives a slightly higher signal-to-quantization noise ratio (by tenths of a 
decibel) and a slightly flatter response within its dynamic range. However, 
its dynamic range is smaller than that of the /a-law system [22]. These 
comparisons assume A = 100 and g. = 255. 


3.4 SOURCE ENCODING OF DIGITAL MESSAGES 

The result of the sampling and quantization discussed earlier is a digital 
signal or message. It is digital in the sense that, for any given time, only 
a discrete number of symbols (output levels of the quantizer) are possible. 
Almost all the remaining parts of this chapter will deal with digital signal 
processing methods designed to work with these digital messages. Of course, 
these processors do not care if the digital message came from a sampler- 
quantizer or from a digital source directly. However, because of the way 
we have chosen to define a digital signal (waveform with a finite number 
of discrete levels) the first function of the processor is to convert each 
discrete level into a suitable digital representation. This function is called 
source encoding. 

Natural Binary Encoding 

The decimal (base 10) number system uses the digits 0, 1, . . . , 9. 
A binary (base 2) number system uses only two digits, 0 and 1, called 
binary digits. The decimal number 13 (a two-digit number) has the equivalent 
binary representation 1101, which we call a codeword. This codeword has 
four bitst (after binary digit), and all possible 4-bit codewords as a group 
form a binary code. An N b -bit code, or sequence of digits, b Ni . . . b^b 2 b x 
can represent any decimal integer number N from 0 to 2 Nh - 1 according 
to 

N = b Ni ( 2 Nt ~‘) + ••• + b 2 (2') + b,(2°). (3.4-1) 


t We adopt this name due to common usage. Do not confuse our use of bit with the 
unit of information. Some texts [23] use binit as the name of the message binary digits. We 
shall rely on context to resolve any conflicts that may arise in definitions. 
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The b n s are either 0 or 1. Here b, is the least significant bit (LSB) while 
b Nb is the most significant bit (MSB). 

A simple form of source encoding is to assign a binary code to represent 
the levels available from the source. When codeword digits are arranged 
in the sequence shown in (3.4-1), the code is a natural binary code. Figure 
3.4-1 illustrates the assignment of natural codewords to a digital source 
having not more than eight quantum levels. Actual levels may change every 
T, seconds, which is the sampling interval for an analog source or the 
symbol duration for a digital source. Clearly, an Af^-digit code has L b = 
2 Nb distinct codewords. When used with a source with L possible symbols 
(or levels), we require 

L L b = 2 Nb (3.4-2) 

for unique coding. 

The source levels obviously can represent negative as well as positive 
sample values. We simply label the levels decimally without regard to the 
polarity of the message sample. The first column of Table 3.4-1 illustrates 
a decimal assignment for a 16-level quantization in which half the levels 
are for positive and half for negative samples. Column 2 shows the cor- 
responding natural binary code. We take an example to illustrate the choice 
of L, L b , and N b . 



Decimal 

(Quantum) 

Level 

Assigned 

Natural 

Binary 

Codeword 


2 


010 


4 


100 


5 


101 


3 


011 


6 


110 


Figure 3.4-1. Natural binary source encoding with a 3-bit code. 
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TABLE 3.4-1. Various Binary and Gray Codewords for a 4-Bit Encoder. 


Decimal 

Level 

Number 

Natural 
Binary Code 

Folded 

Binary 

Code 

Inverted 

Folded 

Binary 

Gray Code 

Comments 

i 

!, 


gtgigigi 

15 

1111 

mi 

1000 

1000 

t 

i 

14 

1110 

1110 

1001 

1001 

Levels 

i 

13 

1101 

1101 

1010 

1011 

assigned 


12 

1100 

1100 

1011 

1010 

to 

i 

11 

1011 

1011 

1100 

1110 

positive 

i 

10 

1010 

1010 

1101 

1111 

message 


9 

1001 

1001 

1110 

1101 

samples 


8 

1000 

1000 

1111 

1100 

i 


7 

0111 

0000 

0111 

0100 

t 


6 

0110 

0001 

0110 

0101 

Levels 


5 

0101 

0010 

0101 

0111 

assigned 


4 

0100 

0011 

0100 

0110 

to 


3 

0011 

0100 

0011 

0010 

negative 


2 

0010 

0101 

0010 

0011 

message 


1 

0001 

0110 

0001 

0001 

samples 

i 

0 

0000 

0111 

0000 

0000 

1 




t 

I 






Sign bits for messages having 





positive and negative values 




Example 3.4-1 

A message has zero mean value and a maximum magnitude |/(i)| m „ = 10 V. It is 
to be quantized using a quantum step size of 0.1 V with one level equal to 0 V. 
We find the necessary values of L , L b , and N b . 

Since this signal always exists between - 10 and + 10 V, there are L = 
(20/0.1) + 1 = 201 levels required. The closest L b value is 256 from (3.4-2), 
requiring an N b = 8-bit code. The actual range of signal that could be handled is 
(256 - 1)0.1 = 25.5 V, corresponding to - 12.5 to 13.0 V, for example. There is 
then an amplitude margin of safety of 25%. 

Other codes are used [20] that are derived from the natural binary 
code. The folded binary code assigns the first (left) digit to sign and the 
remaining digits to code magnitude, as shown in the third column of Table 
3.4-1. This code is superior to the natural code in masking transmission 
errors when encoding speech. If only the amplitude digits of a folded binary 
code are complemented (Is changed to 0s and 0s to Is), an inverted folded 
binary code results (column 4); this code has the advantage of a higher 
density of ones for small-amplitude signals, which are most probable for 
voice messages. The higher density of ones relieves some system timing 
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errors but does lead to some increase in crosstalk in multiplexed systems. 
Advantages outweigh disadvantages, however, and the inverted folded code 
has been used in some telephone systems [20, p. 1684], 


Gray Encoding 

With natural binary encoding, a number of codeword digits can change 
even when a change of only one source level occurs. In Table 3.4-1, for 
example, a change from level 7 to level 8 entails every digit changing in 
the 4-bit illustrated code. In some applications this behavior is undesirable 
and a code is desired for which only one digit changes when any transition 
occurs between adjacent levels. The Gray code shown in Table 3.4-1 has 
this property if we consider extreme levels adjacent. 

Digits of the Gray code, denoted by g k , can be derived from those 
of the natural binary code by 


gk = 


b Nb , 

bk+ i © b k 


k = N b 
k<N h , 


(3.4-3) 


where ® represents modulo-2 addition of binary digits (0 © 0 - 0, 
0 © 1 = 1, I © 0 = 1, 1 © 1 = 0). 

The reverse behavior of the Gray code does not hold. That is, a 
change in any one code digit does not necessarily result in a change of 
only one code level. For example, a change in digit g 4 from 0 when the 
code is 0001 (level 1) to a 1 now corresponds to level 14 codeword 1001, 
a change spanning almost the full quantizer’s range. 


M - ary Encoding 

In some systems it is desirable to encode the L source levels to other 
than a binary base. For a code with N M digits with each digit having M 
possible levels, we require 

L^L m = M Nm (3.4-4) 

for unique encoding. We refer to this as an M-ary code. Table 3.4-2 
illustrates a 3-ary (ternary) code that has N M = 2 digits. In the special 
case where N M = 1, the code base equals the number of message levels. 

Source encoding amounts to mapping a source described by one code 
base to another. In all the digital sources discussed so far, the original 
source was equivalent to a one-digit, base-L codeword (decimal number 
representation of the L levels was used) and its codewords were mapped 
into another base (binary, ternary, M-ary). In every case each resulting 
codeword corresponded to a single source codeword (sample). There is 
another way in which M-ary encoding finds very practical use; it involves 
encoding a sequence of several source symbols (codewords) into one M- 
ary codeword. Consider a binary source ( L = 2) that generates one binary 
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TABLE 3.4-2. Three-ary Encoding of a 
Source with L * 9 Levels. 


Source Level Number 

3-ary Codeword 

s 

22 

7 

21 

6 

20 

5 

12 

4 

11 

3 

10 

2 

02 

1 

01 

0 

00 


digit each symbol interval. A sequence of N b such 1-bit binary words 
represents 2 Nb = L b distinct “words” that are possible during the N„ symbol 
intervals. This sequence can be encoded into a single M-ary code having 
N m digits if 

M Nu = L u s* L b = 2 Ni . (3-4-5) 

Note that with M = 2, N M = N b corresponds to mapping one binary code 
to another, such as converting a natural to a folded binary code. Of course, 
the above ideas are not restricted to binary sources. 

Source Information and Entropy 


Everyone has some intuitive idea of what information is. It will be 
necessary, however, to state a quantitative measure or definition of information 
for our future uses. A weather forecast for 12 days of cool, soaking rain 
for the Sahara desert would obviously carry more information than the 
usual forecast of clear, hot, and dry. Thus we would expect information 
to be larger for messages with small probabilities of occurrence. Furthermore, 
from intuition we would expect that the information contained in two separate 
and independent events should be the sum of the information m both events. 
Finally, we would expect information to be a positive quantity and be zero 
when an event is certain to occur. There is only one function satisfying 
all four of these conditions [23, p. 343]. It is the logarithm, and the chosen 
base determines the unit of information. Base 2 is commonly used in 
communication systems because it represents the smallest measure of choice. 

Thus we define the information I k contained in a source symbol having 
probability of occurrence P k as 


I k = - logjCFj = logzlpj. 


(3.4-6) 


where the unit is the bit. The average information of a source, called 


I 


i 
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source entropy, is obtained by averaging the information in all possible 
symbols. For L symbols this entropy, denoted by H, ist 

H = - i P k \og 2 (P k ) --7^2 (bits/symbol). (3.4-7) 

*-i lnuJ *-» 

Example 3.4-2 

A quantizer output has four levels that occur with probabilities 0.1, 0.3, 0.4, and 
0.2. We find the source entropy to be 

H= -[0.1 log 2 (0.1) + 0.3 log 2 (0.3) + 0.4 log 2 (0.4) + 0.2 log 2 (0.2)] 

= 1 .8464 bits/symbol. 

If the quantizer symbols had equal probabilities, entropy would be 

H = -4(i)log 2 (l) = 2.0 bits/symbol. 

Example 3.4-2 shows that the source with equal-probability output 
symbols has the largest entropy. This will be true in general [24]. 

Source entropy of (3.4-7) can also be applied to a sequence of symbols 
emitted over time. If the source is stationary t and its sequential symbols 
are statistically independent, the average information in a long string of 
symbols is again given by (3.4-7). Such a source is called a zero-memory 
source. 

Optimum Binary Source Encoding 

The entropy of a source, given by (3.4-7), is the smallest average 
number of binary digits that can be used per symbol to encode the source. 
Practical encoders typically require a larger average number of digits per 
symbol, which we denote by N. We define code efficiency by 



The binary codes discussed earlier all used a fixed number of bits (N b ) for 
each source symbol; for these codes N = N b and r) c „ ic « 1. with the 
equality holding only if source levels are equally probable and the number 
of source levels is a power of 2 (L = L b = 2 Nb ). The efficiency for the 
source of Example 3.4-2 is 1.8464/2.0 = 0.9232, or 92.32%, for example. 

If we abandon the requirement that codewords be of fixed length, it 
is possible to obtain more optimum codes that increase code efficiency. 
Huffman [25] has devised an encoding procedure that is optimum in the 
sense that it gives the smallest possible average number of binary digits 

t The second form of (3.4-7) is more convenient in computations than the first. Here 
ln( ) stands for the natural logarithm; ln(x) = log.(x). 

t A stationary source has symbol probabilities that do not change with time. 
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per source symbol subject to the constraint that the code be uniquely 
decodable. 

The Huffman procedure uses the following steps for binary encoding: 


1. Arrange the L source symbols in decreasing order of their probabilities. 

2. Combine the two symbols with smallest probabilities using a T junction 
to form a new symbol with probability equal to the sum of the two 
probabilities. Label the two input branches of the T with a 0 (lower 
branch) and a l.t 

3. Rearrange the original unused symbols and the new symbol in decreasing 
order of probabilities. 

4. Repeat Steps 2 and 3 until finally only one symbol remains with 
probability 1.0. 

5. The codeword for each source symbol is obtained by backtracing 
through the path generated in Steps 1-4 that leads to the symbol of 
interest. Each time a T is encountered, a new digit is added to the 
symbol’s codeword. 

The Huffman procedure is illustrated in Fig. 3.4-2 for an eight-symbol 
source. The number of source symbols does not have to be a power of 2, 
however. Observe that symbols with higher probability are assigned shorter 
codewords. The average word length becomes N = 2(0.35) + 2(0.23) + 
3(0.15) + 3(0.10) + 3(0.08) + 4(0.05) + 5(0.03) + 5(0.01) = 2.55 digits 
per source symbol. A fixed length code would require 3.0 digits per source 
symbol. From (3.4-7) this source has entropy 2.4863 bits per source symbol, 
so the Huffman code efficiency is 2.4863/2.55 = 0.975, whereas the 3-bit 
code gives only 2.4863/3.0 = 0.8288. 

The Huffman procedure can also be applied to M- ary encoding [17]. 
Basically the preceding procedure is used except with T junctions having 
M input branches labeled 0, 1, . . ., M — 1. For arbitrary L the last T 
will have M input branches only if L = M + n(M — 1), where n is an 
integer. If this relationship is not satisfied, dummy source symbols, all 
with zero probability, are added so that the last T has M inputs. The 
encoding procedure is then followed but the dummy symbols are discarded 
at the end. 

Source Extension 

When the number of source symbols is small and the symbol probabilities 
are significantly different, the optimum Huffman encoding procedure of the 
preceding subsection may still yield low code efficiencies. By using a 
technique called source extension, code efficiency can be increased. We 

t This labeling is arbitrary. We could also have assigned 0 to the upper branch. 



Figure 3.4-2. Huffman binary source coding of eight-symbol source. 
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discuss the technique under the assumption that source symbols are sta- 
tistically independent over time. 

By the preceding developments, an L-symbol source is encoded into 
a variable-length code using Huffman's procedure, which we now refer to 
as a first-order extension of the source. Clearly, nothing has really changed 
in first-order extension. However, in a second-order extension we consider 
two source symbols at a time for encoding. For L possible symbols per 
symbol interval and two intervals, there are L 2 possible two-element symbols. 
If all such two-symbol sequences are treated as a new source, this source 
has L 2 statistically independent symbols and its entropy is twice that of the 
original source. Huffman’s procedure can be applied to the new source to 
produce a binary encoding with efficiency typically larger than that of the 
original source. We consider an example of second-order extension. 

Example 3.4-3 

Consider a source of two symbols A and B having probabilities of occurrence of 
P(A) = 0.8 and P(B) = 0.2. Clearly, the Huffman procedure simply assigns a code 
1 to symbol A and code 0 to B. Entropy is readily found to be H = 0.7219 bits 
per symbol, so code efficiency is r) code = 0.7219. 

The second-order extension of the source is defined in Table 3.4-3. Pairs of 
source symbols are now encoded by the Huffman procedure as though they were 
single symbols from a new source. Source entropy of the new source is readily 
found to be H = 1.4439 bits per new symbol, so efficiency becomes = H/N = 
1.4439/1 .56 = 0.9255. This efficiency is 28.2% larger than that realized by encoding 
of the binary source on a per-symbol basis. 

To generalize the preceding procedure, we define the p-order extended 
source as the new source having L p symbols, which result from consideration 
of groups of p symbols from the L-symbol original source. Thus we encode 
sequences of p symbols from the L-symbol source. For large p, efficiency 
converges to unity. We illustrate these comments by developing the third- 
order extension of the binary source of Example 3.4-3. 


TABLE 3.4-3. Second-order Extension of a Binary Source with Symbol 
Probabilities P(A) = 0.8 and P(B) = 0.2. 


Second-order 
Extended Source 
Symbols 

Symbol 

Probability 

Huffman 

Codeword 

Probability • [Number of Code 
Symbols] 

AA 

0.64 

mm 

0.64 

AB 

0.16 


0.32 

BA 

0.16 

■ffh: 

0.48 

BB 

0.04 

H 

0.12 


N = 1.56 digits per symbol 
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Example 3.4-4 

The third-order extension of the binary source of Example 3.4-3 is summarized in 
Table 3.4-4. Here three source symbols at a time are Huffman-encoded to form 
an eight-symbol source of entropy H = 2.1658 bits per new symbol. Average 
codeword length is N = 2.184 binary digits per new codeword. Efficiency becomes 
T) CO de = 2.1658/2.184 = 0.9917, which is near the theoretical limit of 1.0. 


Other Encoding Methods 

All source encoding methods described earlier relate to zero-memory 
sources. When a source has memory so that output symbols in a sequence 
are not independent, source encoding methods can still be devised, usually 
based on Markov models (where any given symbol depends on at most a 
finite number of past symbols). These methods are considered to be beyond 
our scope. We shall, however, describe some special forms of source 
encoders for sources with memory. These descriptions are deferred to 
Chap. 4 because they are best handled as part of the discussion of an overall 
system. 


TABLE 3.4-4. Third-order Extension of a Binary Source with Symbol Probabilities 
P(A) = 0.8 and P(B) = 0.2. 


Third-order 
Extended Source 
Symbols 

Symbol 

Probability 

Huffman 

Codeword 

Probability • [Number of Code 
Symbols] 

AAA 

0.512 

1 

0.512 

A AB 

0.128 

0 1 1 

0.384 

ABA 

0.128 

0 1 0 

0.384 

BAA 

0.128 

0 0 1 

0.384 

ABB 

0.032 

0 0 0 1 1 

0.160 

BAB 

0.032 

0 0 0 1 0 

0.160 

BBA 

0.032 

0 0 0 0 1 

0.160 

BBB 

0.008 

0 0 0 0 0 

0.040 


N = 2.184 digits per symbol 


3.5 CHANNEL ENCODING FUNDAMENTALS 

The purpose of the channel encoder of Fig. 1.3-1 is to convert the source 
code to a form that will allow the receiver to reduce the number of errors 
that occur in its output due to channel noise. The theory surrounding 
channel encoding is vast and we shall make no effort to survey the topic. 
We only present a few of the more important and fundamental concepts 
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and techniques that are within our scope and that give at least a flavor for 
what channel encoding involves. 

The channel encoder adds redundancy to the source code by inserting 
extra code digits in a controlled manner so that the receiver can possibly 
detect and correct channel-caused errors. The resulting channel codes are 
usually binary. This is the only case we shall consider, although much 
theory has also been developed for nonbinary codes. The encoding process 
usually falls into two classes: block codes (often called group codes) and 
convolutional codes (also known as tree codes). 

Block codes correspond to subdividing the sequence of source (binary) 
digits into sequential blocks of k digits. Each &-digit block is mapped into 
an n-digit block of output digits, where n > k. The ratio k/n is called the 
code rate, or code efficiency. The difference 1 - (k/n) is called redundancy. 
The encoder is said to produce an (n, k) code. Thus a (7, 4) block code 
involves seven output digits per codeword for every four-digit input data 
codeword; the 7 - 4 = 3 extra digits are called check digits. Block codes 
are memoryless codes because each output codeword depends on only one 
source k-bit block and not on any preceding blocks of digits. Typical 
parameter values range from three to several hundred for k and from { to 
i for k/n [26, p. 9]. 

In contrast, convolutional codes involve memory implemented in the 
form of a binary shift register having K cascaded registers, each with k 
stages. The sequence of source digits is shifted into and along the overall 
register, k bits at a time. Appropriate taps from the various register stages 
are connected to n modulo-2 adders, as illustrated in Fig. 3.5-1. The output 
code becomes the sequence of n digits at the output of these adders generated 
once for every input shift of k source digits. The ratio k/n is still called 
the code rate, and K is called the constraint length [27]. Clearly, each n- 
bit output codeword depends on the most recent k source bits stored in 
the first (left) A:- stage shift register as well as K - 1 earlier blocks of k 
source bits that are stored in the other registers. A listing [27] of some of 
the better convolutional codes shows typical values of 2 « K 14, 1 
H4, and 2 =s n *£ 8 and rates from I to |. 

The main advantage of channel encoding is that system performance 
is improved relative to no coding. For two systems (coded and uncoded) 
that operate on the same channel (same noise level) and give the same rate 
of errors, performances are compared through the required transmitter 
energies required per bit. The ratio of required energies is called coding 
gain. Thus if the coded system requires half as much energy as the uncoded 
system, its coding gain is 2 (or 3 dB). For practical error rates channel 
encoding gives positive coding gains that range from less than 1 dB [26, 
p. 36] to well over 4 dB [26, pp. 248-249], depending on type of encoding 
and specific values of K, k, and n. 



Block Codas 

The simplest block code adds a single bit to the k data bits. The 
added bit, called a parity check bit, is chosen to make the total number 
of Is in the codeword always either even or odd. This simple coding 
procedure is capable of detecting only errors involving an odd number of 
bits. Errors in an even number of bits go undetected. In many systems 
it is unlikely that more than one error per word will occur; in these systems 
the simple parity check has value. 

The single parity check bit code has no error correction capability. 
By adding more check digits, the ability to correct for errors becomes 
possible. For correction of all patterns of t e or fewer errors and no others, 
the total number n of code digits is related to the number of data codeword 
digits k by [26, p. 6] 



where (") = n\/[i\(n - /')!] is the binomial coefficient. The equality is 
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achieved only for perfect codes ( Hamming codes discussed later are perfect 
codes). Most known codes are not perfect. For a A: = 3-digit data code 
(8 source symbols), for example, (3.5-1) indicates at least 3 check digits 
(n = 6) are necessary to correct only one error (t e = 1), which is a (6, 3) 
code. 

Block Codes for Single Error Correction 

Suppose we examine channel codes where the first k digits are made 
identical to the source (data) code and the r = n - k parity check digits 
are appended at the end. This digit arrangement is called a systematic 
code. All other linear (n, k) codes are equivalent to, and can be converted 
to, a systematic code [27, p. 245] by elementary operations, so our de- 
velopments apply to any linear (n, k ) code with suitable conversions. If 
the input binary codeword is written in the form b\b 2 . . . b k , where bj — 
0 or 1 for all 1 =£ j =s k, the output codeword has the form 


b l b 2 ...b k c i C 2 -..c r 


(3.5-2) 


where c ; is check digit i. 

In order for the check digits to be able to correct an error, they must 
be functionally related to digits bj. Let the c, be made linear functions of 
the bj as follows: 


C] = h u b\ © h u b 2 © © h u b k 

c 2 = h 2l b } © h 22 b 2 © "• © h 2 ibk 

c r = h rl b i © h r2 b 2 © •" © h rk b k . 


(3.5-3) 


Here the h tJ are constants (0 or 1) and products h u bj are defined as follows. 
0 • 0 = 0, 1 • I = 1, 0 • 1 = 0, 1 • 0 = 0. By adding the c,s to both sides 
of (3.5-3),t it can be written in the form 


[H][T] = [01, 

where [0] is an r x 1 column matrix of all binary zeros, 



~h u 

h\ 2 

.. h xk 

1 

0 . 

.. o' 

[H] = 

h 2 \ 

h 22 - 

.. h 2k 

0 

1 . 

.. 0 


J Vi 

h r2 . 

.. h rk 

0 

0 . 

.. 1 


(3.5-4) 


(3.5-5) 


is called the parity check matrix, and [T] is the output codeword (vector) 


t Clearly c, © c, = 0 because 0©0 = 0, 0 © 1 - 1, 1 © 0 - 1, and 1 © 1 0 

in modulo-2 addition. 
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given by 


(3.5-6) 


If [T] is the transmitted codeword, t the received codeword, denoted 
by [/?], can be different than [T] in only one digit. Then we can write 

[R] = [ T ] © [E], (3.5-7) 

where [£] is an error codeword having all digits 0 except the error digit, 
which is a 1. If there were no error, the product [//][/? ] = [H][T] would 
be [0] from (3.5-4). With errors, however, we have 

[S] = [i/][J? ] = [H][T] © [H][E] = [ H][E ], (3.5-8) 

where [5] is called the syndrome. Thus if [ E ] represents an error in the 
/th digit, the syndrome is just the ith column of [//]. This means that the 
receiver simply computes [5] = [//][/? ] and identifies which column of [H] 
has been reproduced in [5]; that column is the digit in error in [/?]. The 
erroneous digit is then corrected to form the final receiver output codeword. 


Example 3.5-1 


1 

1 

1 

0 

1 

0 

o' 

1 

1 

0 

1 

0 

1 

0 

1 

0 

1 

1 

0 

0 

1 


Let the input (data) codeword be 1110. We find the output (channel) codeword 
and then show that the syndrome identifies an error in position 5 of the channel 
codeword. 

From (3.5-4) we require 


T 

1 

1 

0 

1 

0 

o' 

1 

1 

0 

1 

0 

1 

0 

1 

0 

1 

1 

0 

0 

1 


t Many texts use row matrices as codeword representations. Either row or column 
definitions are valid as long as all equations are consistent. 
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1©1©1©0®c, = 0 
1©1©0©0©c 2 = 0 
1©0©1©0©c 3 = 0, 

which gives c, = 1, C 2 = 0, c 3 = 0. Hence [T]' = [1110100], where [•]' represents 

matnX F t o r r an an P e S rr e or in digit 5, we have [*]' = [1H0000] and 

111 


'1110 
[5] = [H][R1= 110 1 
10 11 


1 0 0 
0 1 0 
0 0 1 


Because [5] is the fifth column of [HI the error is in the fifth digit. After correction 
of the fifth digit, the correct word is d ecoded. 

The procedure used in Example 3.5-1 for finding the transmitted code- 
word [T] corresponding to an input (data) codeword defined y 

'bi~ 

[B] = b2 ( 3 - 5 ' 9 > 


can be formalized. By rewriting (3.5-9) and combining with (3.5-3) to get 
fi, = 1 £>, © 0 fi 2 © "• © 0 b k 
b 2 = 0 b x © 1 b 2 © ' ' ’ © 0 b k 


b k = 0i>i©0f>2©-©lfr* 
c, = fin fii © fin b 2 ® © b\ k b k 


(3.5-10) 


C r = fi, 

i b 

,©fi, 

•2 b 2 ® " 

• © h rk bk 


we readily have 


[T] = 

[G ]'[£], 


(3.5-11) 

where 

"1 

0 ... 

0 fin 

h 2i • • • h r i 


[G] = 

0 

1 ... 

0 fi,2 

h^7 • • • ^r2 

(3.5-12) 


.0 

0 ... 

1 fii* 

hi k ••• h rk _ 



is called the generator matrix. Our result (3.5-11) shows that the output 
codeword is given by the product of the transpose of the generator matrix 

and the input codeword. 
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Hamming Block Codas 

The design of codes consists, to a large extent, in finding suitable 
parity check matrices. (Clearly, finding [ H ] also gives [G] since the two 
are related from (3.5-5) and (3.5-12).) Hamming [28] has given a procedure 
for finding perfect block binary codest that have single error-correction 
capability. It follows from (3.5-1), with t e = 1, that 

2* = — (3.5-13) 
(1+n) 

for these codes. Equation (3.5-13) will have a solution only when n + 1 
is a power of 2, or 

n = 2 m - 1, m = 1,2,... . (3.5-14) 

By substituting (3.5-14) back into (3.5-13), we have 

0(2'”- 1) 

2* = . (3.5-15) 

2 m 

After equating the exponents, we obtain 

k = 2 m - 1 - m. (3.5-16) 

By combining (3.5-14) and (3.5-16), the allowable Hamming codes are defined 
by 

(n, k) = (2 m - 1, 2 m - 1 - m), m = 1, 2 (3.5-17) 

The first five nontrivial codes are (7, 4), (15, 11), (31, 26), (63, 57), and 
(127, 120). The efficiency (code rate) approaches unity for Hamming codes 
as m becomes large. 

The parity check matrix is readily defined for the Hamming codes. 
The n = 2 m — 1 columns of [H] consist of all possible (r = m)-digit binary 
words except the all-zero word. The resulting matrix [H] is not in systematic 
form but can be rearranged to be systematic by column permutations. We 
demonstrate these points by an example. 

Example 3.5-2 

For the (7, 4) Hamming code, the n = 7 columns become all r = 7 - 4 = three- 
digit binary words except the word having all zeros: 

0 0 0 1 1 1 f 

[H] = 0 1 10 0 1 1 . 

10 10 10 1. 

Column rearrangement puts [H] in systematic form 

0 11110 0' 

[/y] =101 1010. 

110 10 0 1 


t Recall that a perfect code is defined as one where the equality holds in (3.5-1). 
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Decoding of Block Codes 

Recovery of the original data binary sequence from the codeword 
produced by the receiver demodulator (Fig. 1.3-1) typically involves two 
techniques. First, the received codeword may be in error because of channel 
noise - these errors are corrected as well as possible using techniques such 
as the syndrome discussed earlier. Second, the corrected codeword is 
associated as well as possible with the corresponding original data word 
through knowledge of the code employed. 

If the error rate is low and corrected received codewords are exact, 
decoding is especially easy for systematic codes. The parity check digits 
may be discarded since the first k digits form the original data codeword. 

If the error rate is high enough that all errors cannot be corrected, 
or if the code has no error-correction capability, decoding can be accomplished 
by codeword comparisons. By comparing the received codeword to each 
of the possible codewords, the closest match, according to a suitable measure 
of closeness, is declared the proper codeword. One often-used measure 
of closeness is the Hamming distance. Two codewords, f?, and Rj , of equal 
length that differ in d u digit positions are said to have Hamming distance 
d .. Clearly, the smaller the Hamming distance, the more nearly two code- 
words are the same. When word length is large so that many codeword 
comparisons are required, this technique becomes less attractive. 

For additional details on decoding of block codes, as well as other 
codes not covered here, the reader is referred to some of the recent literature 
[26, 29-34]. 

Convolutional Code* — Tree Diagram* 

Three methods — tree diagrams, trellis diagrams, and state diagrams— 
may be used to describe a convolutional code. We briefly consider each 
of these methods through the description of an example encoder. In each 
case we assume the encoder shown in Fig. 3.5-2(a). It consists of a single 
(k = 1) shift register with K = 3 stages suitably connected to n - 2 
modulo-2 adders. The contents of the adders are sampled to provide a 
sequence of two output binary digits for each input binary digit in a sequence 
of input digits. As shown, shift register stages 5, and S 3 are modulo-2 
added to produce the first digit in the output sequence. Stages S„ S 2 , and 
S 3 are all modulo-2 added to obtain the second digit. 

For example, let us find the encoder’s output for an input sequence 
of 101 . Initially assume the shift register contains all Os (it is said to be 
cleared or reset!). Output is begun after the first digit, 1, arrives. The 
shift register now contains 100 (left to right) and the adder outputs become 
100=1 (adder 1) and 1 © 0 © 0 = 1 (adder 2). The output sequence 

t Actually, the rightmost digit is arbitrary because it shifts out of the register and is 
lost anyway as soon as the first data digit occurs. 



Bit 1 ' ' ' Bit 2 Bit 3 Bit 4 Bit 5 

(b) 


Figure 3.5-2. Tree diagram (b) for convolutional encoder of (a). 



„ Baseband Digital Waveforms Chap. 3 


11 is generated before the second input digit arrives. After the second 
digit, 0, arrives, the register contains 010; the adders contain 0 © 0 - 0 
(adder 1) and 0 © 1 © 0 = 1; the output sequence is 01. The third input 
digit 1 produces 101 in the shift register, 1 © 1 = 0 in adder 1, 1 © 0 © 

1 = O in adder 2, and an output sequence 00. Thus the input sequence 

101 has generated the sequence 110100. 

A tree diagram is a means of illustrating the preceding sequence ot 
events as well as all other possible sequences that may occur in the encoder. 
Figure 3.5-2(b) shows the tree for the example encoder of (a). Tree operation 
is as follows: A given sequence of input digits causes a particular path to 
be followed through the tree from left to right. Each new digit corresponds 
to branching either upward (for a 0) or downward (for a 1) at a node; the 
chosen branch leads to the next node in the tree path. The shift register s 
contents (in the boxes) and the output sequence generated are shown for 
each branch. For example, the first two digits 10 of the sequence of the 
previous paragraph lead to the first node marked c. 

Figure 3.5-2(b) shows that the tree diagram is repetitive after the third 
branching. Only four distinct branch points (nodes) result; they are marked 
a b c and d\ these are called the states of the encoder. Every time a 
tree ’path arrives at node (state) b, for example, the upper branch always 
corresponds to an output 01 and the lower branch generates 10. Some 
reflection by the reader will show that the branch chosen at any given state 
node is determined by the most recent input digit, whereas the type of state 
node ( a , b, c, or d) is determined by the two previous digits. Amval at 
state a corresponds to previous input digits 00, b to 01, c to 10, and a to 11.' T 
In our example tree diagram, only four states occur and only two 
branches emanate from each node. More generally, an encoder of rate k/n 
and constraint length K will have 2* branches emanating from each node 
in the tree and 2 W_,) states [27]. (Why?) 


Convolutional Code* — Trellis Diagrams 

The number of branches in a code tree doubles each time a new input 
digit occurs. For a long sequence of input digits to be encoded, the usefulness 
of the tree diagram is limited. A better approach uses the trellis diagram, 
shown in Fig. 3.5-3(a) for the encoder of Fig. 3.5-2(a). A trellis cames the 
same information as a tree but makes use of the fact that the tree is periodic 
(in the steady state condition) and involves only a finite number of states. 

Use of the trellis is similar to that of a tree. Again each node in Fig. 
3.5-3(a), for our example encoder (K = 3, k = 1, n = 2), has two branches; 


t Branches in the tree occurring prior to the first state nodes correspond to a transient 
interval in the encoder. Branching from state nodes to other state nodes corresponds to a 
steady-state condition. 
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110 1 

' v — ' 

Input Sequence 


(b) 

Figure 3.5-3. (a) Trellis diagram for the encoder of Figure 3.5-2(a). (b) Trellis 
path for input digit sequence 1101. 


upper (shown solid) and lower branches (shown dashed) correspond to input 
digits of 0 and I, respectively. After the first two input digits, the encoder 
will be in one of its four states a, b, c, or d, represented by open dots. 
Thereafter each new input digit will only cause transitions to occur from 
one state to another and the trellis becomes repetitive. 


Example 3.5-3 

The input sequence to the convolutional encoder of Fig. 3.5-2(a) is 1101 when the 
shift register is initially cleared. We find the output sequence of digits and the 
trellis path traced by this sequence. From the trellis of Fig. 3.5-3(a), the output 
sequence is 11101000. The trellis path is shown in Fig. 3.5-3(b). 
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The general convolutional encoder of rate k/n and constraint length 
K will have 2* branches input to each state node in its trellis and 2 branches 
leaving each state node (after thefirst which terminates the transient interval). 
The number of possible states is 2 [27, p. -86]. 

Convolutional Codes— State Diagrams 

From our preceding discussions it is clear that once a st ® a ^V sta ^ 
condition is reached in a convolutional encoder, each new input bit causes 
aTansition from on. s.at. to anchor. These r s can Jo h 

described by a state diagram, as shown in Fig. 3.5-4 for our example 
encoder of Fig 3 5-2(a). Paths within the diagram correspond to transitions 
ES node to another caused by a current 
dashed paths for 1). For example, if the encoder is “ 
currently arriving bit is a 1, it causes a transition to state b (dashed line) 
corresponding to a generated output of 11. (See the tree of Fig. 3. • ( . )•) 

If the bit were a 0 instead, the only other possible transition (from the tree 
diagram) is to state a (back to itself), as shown by the solid line, the 
generated output is now 00. Transitions from other states are described in 

a similar way. 

Vitarbi Decoding of Convolutional Codes 

In the receiver, the demodulator will estimate what sequence of binary 
dieits is being received over the channel. Occasionally the demodulator 
will decide that a 0 was received when a 1 was actually received, and vice 
versa In other words, the demodulator’s output codewords will occasionally 
rrmtain erroneous digits. The purpose of the channel decoder is to accept 
the erroneous sequence of demodulator output digits and produce the most 



Figure 3.5-4. State diagram for the encoder of Figure 3.5-2(a). 
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accurate replica possible of the source sequence that was input to the 
channel encoder back at the transmitter. 

For convolutional codes, the optimum decoding process amounts to 
finding the single path through the code tree or trellis that most nearly 
represents the demodulated digit sequence. Obviously the transmitted code 
digits correspond to a specific path through the tree or trellis. However, 
the receiver has no knowledge of the exact path and it can only use the 
received sequence, which possibly has errors, to find the most likely path 
that corresponds to the received sequence. This most likely path is then 
used to specify the decoded (data) sequence that would have generated the 
path. This procedure is called maximum-likelihood decoding, t 

The Viterbi algorithm [35] is a maximum-likelihood decoding procedure 
based on finding the tree or trellis path with the smallest distance between 
its digit sequence and the received sequence. The Hamming distance between 
two codewords of the same length has been defined as the number of digits 
that differ in the two sequences. For example, the sequence 011010111 
differs from the sequence 111001101 in digits 1, 5, 6, and 8, so the Hamming 
distance is 4. 

The Viterbi algorithm is best described through an example using the 
code’s trellis. Again, we shall consider the encoder of Fig. 3.5-2(a) having 
the trellis of Fig. 3.5-3(a). Now suppose we wish to send the source encoded 
sequence 01101 to the receiver. For purposes that will be more obvious 
as we progress, we append two zeros to the end of the sequence, called 
the tail ; the source encoded sequence becomes 0110100. From the code 
trellis the channel encoded sequence is readily found to be 00111010000111. 
These sequences are shown in Fig. 3.5-5(a). Next, assume digits 4 and 10 
are erroneously estimated by the receiver so its demodulator output sequence 
is 00101010010111 as shown in (b). The channel decoder must work with 
only this sequence and knowledge of the trellis applicable to the channel 
encoder. 

We now describe the Viterbi procedure. Define the trellis’ branch 
levels as 1, 2, 3, ... , starting with the root node as branch level 1. Paths 
between any two branch levels in the trellis correspond to one source 
encoded digit and two channel digits in our example. The first step in our 
procedure is to draw the trellis paths up to the second level at which state 
nodes occur (level 4 in our example). There will be eight distinct paths, 
as shown in Fig. 3.5-6(a). The Hamming distances are computed between 
the first six received digits (00 10 10) and the sequences corresponding to 
the eight paths. Two paths merge at each of the two state nodes at level 
4. Because the succeeding path must be the same for two paths that merge 
at a state node, we compare the two Hamming distances of the two merging 

t We describe hard-decision decoding. There is also a procedure called soft-decision 
decoding that can further improve performance [31, p. 371]. 
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Survivor qq h 10 10 00 01 11 

Path Word 

Decoded 0 1 1 0 1 0 0 

Data Word 

Figure 3.5-5. (a) Transmitter sequences and (b) receiver sequences applicable 
to Viterbi decoder that gives the survivor path of (c) for the channel encoder of 
Figure 3.5-2(a). 

paths at each node and select the one with smallest distance; it is called 
the survivor. The survivor paths are shown by solid hnes in Fig. 3.3-b(a), 
and discarded paths are shown as dashed lines. Hamming distances are 
shown for each path with the survivor’s distance circled 

The next step is to extend the survivors at branch level 4 to branch 
level 5, using the eight possible trellis paths and recompute the Hamming 
distances for the eight paths relative to the received sequence (00 10 10 
10) Again we determine the four survivors at branch level 5 nodes, as 
shown in Fig. 3.5-6(b). Here there are ties at nodes a and fc. In c ^ of 
ties we make a random decision and choose the upper of the two paths. 
This step is repeated until all input regular 2-bit channel sequences are 
considered (one additional level to branch level 6 in our example). There 
are always only four survivors at each branch level. 

To finally make a choice of one surviving path of the four survivor 
paths, the two tail digits 00 are used. Since the receiver knows that the 
next two branch intervals correspond to tail 0 digits, t only those paths in 
the trellis are examined. From the trellis, the branches emanating from 


t The receiver knows the tail arrival time through synchronization. 



Figure 3.5-6. Survivor paths applicable to the text’s example of the Viterbi 
decoding algorithm. 
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nodes a and c will merge at node a for the first tail branch interval, whereas 
Sals from i and d wUl merge at node c. Thus only two survivors exist 
(one at node a and one at node c) at the end of the first tail dig't s branch 
interval Finally, the second tail digit 0 will force the paths emanating from 
nodes a and c to terminate at node a (at branch level 8 ).t Figure 3.5-6(c) 
shows Ae final paths in the tail intervals and all preceding survivor paths^ 
Thefinal survivor path is shown in Fig. 3.5*0 along with the data sequence 
to the path. Observe .ha. both channel errors have been 

corrected by the example code. The 

The above example used only a 5-digit sequence of data digits. The 
sequence was termini by the tail digits 00, which also reset the decoder 
shift register for another data sequence. Any length data sequence can b 
Sed but the decoder memory required to store the successive survivor 
paths becomes large. The delay involved in obtaining the final survivor is 
Iso too long for most practical applications. One approach to solving these 
problems is to determine survivor paths using only a finite (smal l number 
of preceding branch intervals. In effect, the oldest parts of the rell.s 
survivor paths are discarded. It has been found by computer simulation 
that the loss relative to optimum Viterbi decoding is negligible if the num er 
of retained branch intervals is not less than 5 K [27, p. 2V8J. 

The more general Viterbi decoder for a rate k/n code of constraint 
length K will have 2 W_1) states in its trellis. Each state will have 2 paths 
S™ .. Slate node aud 2* paths emattatitte from these nodes (m 
steady 8 state). Thus the number of required computations at each branch 
interval increases exponentially with k and K For this reason the 
algorithm is typically used only with small values of k and K. 


3.6 WAVEFORM FORMATTING OF DIGITAL SIGNALS 

The preceding discussions have centered mainly on the A/D conversion 
source encoding, and channel encoding concepts gjta* * 

suitable sequence of digital symbols for input to the modulator of F «- 1 - 3 • 
! ;rs attention is focused on the modulator, whtch eonvot .s . he 
binary symbols into a suitable waveform for transmission over the channel, 
w" eonsideV only baseband waveforms. These waveforms are typ, call, 
seauences of pulses generated according to the input digital code. 

We may think of the process of waveform seiection as a/ormam^ 
of the digital code. There are many formats (waveforms) that have been 
developed. We shall discuss only those that are considered the most bas.c 
and widely used. In all cases we assume rectangular pulses for clanty 


t Recall that two successive data zeros reset the shift register, which is equivalent to 
forcing all trellis paths to converge at node a. 
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illustration, even though practical systems often purposely use nonrectangular 
signals. The more practical waveforms are considered further in Chap. 4. 

Unipolar Waveform 

A unipolar waveform transmits a pulse for a code 1 and no pulse for 
a 0. For a symbol interval of duration T k and a channel encoded digital 
sequence as illustrated in Fig. 3.6-l(a), the unipolar waveform is shown in 

Channel Encoded Sequence { a k } 

1100010111001 0 



— 1 I— 


(b) Unipolar 





(e) Manchester 



Figure 3.6-1. Waveform formats for the digital sequence of (a): (b) Unipolar, 
(c) polar, (d) bipolar, (e) Manchester, and (f) differential formats. 
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(b) The pulse illustrated is said to be a nonreturn-to-zero (NRZ) type 
because it occupies the full symbol interval’s duration, T„. Use of a «ower 
pulse, typically of duration TJ 2, corresponds to a return-to-zero (RZ) type. 
The RZ type waveform requires more bandwidth than the NRZ type. Clear y, 
the format can be changed to use negative instead of positive pulses if 

One of the main advantages of the unipolar format is its simplicity. 
It also interfaces well with digital logic. Disadvantages are that it does not 
perform as well with noise as some other formats and is not as easy to 

synchronize in the receiver. t , , - 

For use in Sec. 3.7 to follow, it is desirable to formulate a model for 

the generation of the unipolar pulse stream. If we let a k ,k = 0, ± 1, ± - 
represent the channel encoded sequence of binary digits (a* - 0 or 
a k = 1, all k), the transmitted unipolar waveform (NRZ type) can be written 


°° t 

s(t) = 2 rect ~ 

* = -oo 


- (TJ2) - kT„ 
T b 


(3.6-1) 


where a* represents the amplitudes of pulses with a k equal to A or 0 f 
a, = l or 0, respectively. We draw on the sampling models of Chap. 2 
and observe that (3.6-1) can be generated by the network of Fig. 3.6-2 if 
the network is defined by 

. T t - (Tb/ 2)1 n 

p(t) = rect l f b J U ’ 


(3.6-3) 


P(o>) = T,Sa ^ e 


This model is useful in calculating the spectral properties of s(t) m the 
following section, but it does not necessarily reflect a practical way of 
generating the channel’s waveform. (Think about why this fact is true.) 

To demodulate the waveform of (3.6-1), the receiver can sample he 
received waveform once during each symbol interval If it senses leve A 
is present, a 1 is declared the code symbol received. If level zero (no 
pulse) is sensed, a 0 is declared. 

Polar Waveform 

Figure 3. 6- 1(c) illustrates the polar signal format. Here a positive 
pulse is generated for each code 1 and a negative pulse is used for a 0. 
Reversal of polarities can also be used. The NRZ type format is shown 


t A Dulse train generated by the receiver's clock, must be synchronized with the 
received bit stream so that the receiver will know, in time, when the b.t (or symbol) intervals 


occur. 
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a(r)= E a k S(.t - kT b ) ► pit), P(u) *-s(r)= Z ot k p(t - kT b ) 

k “ k « 

Figure 3.6-2. Model for the generation of a formatted waveform. Amplitudes, 
a k , of the impulses are related to the binary sequence of channel encoder output 
digits. 

but an RZ type is also possible for pulses of duration less than the symbol 
(digit) interval T b (pulse duration typically T h /2). The polar NRZ signal 
performs better in noise than the polar RZ waveform (for equal pulse 
amplitudes and values of T b ). However, the latter can more readily be used 
in the receiver to provide synchronization because it always has two level 
transitions per symbol interval. 

The model of Fig. 3.6-2 applies to both polar NRZ and polar RZ 
formats if sequence coefficients a k now have values +A (for 1) and -A 
(forO). Equations (3. 6- 1 )— (3 . 6-3) apply to the NRZ waveform. For the RZ 


f ~ (TJ 4) - 
TJ 2 

- (TJ 4)1 


s(t) = ^ a k rect 


p(t) = rect 


P( u) = ~ Saf^V^ 4 
2 4 / 


(3.6-4) 


(3.6-5) 


(3.6-6) 


if pulses have duration T b /2. 

To demodulate the polar waveform, the receiver can sample the signal 
once each symbol interval. Sample amplitudes of A or -A correspond to 
binary digits 1 and 0, respectively. Of course, our comments assume noise 
is absent. Noise effects are discussed later (Chaps. 4 and 5). 


Bipolar Waveform 


In the bipolar format, code 0s are transmitted by no pulse and Is are 
sent as pulses of alternating polarity, as shown in Fig. 3.6-l(d) for NRZ 
type. RZ can also be implemented, usually with pulse durations TJ2. A 
main advantage of the bipolar format is that it contains no dc component, 
even if long strings of 0s or Is occur (this is not true for unipolar and polar 
waveforms). Receiver circuitry can be ac coupled and dc drifts become 
less of a problem. 

The reader can readily verify that the model of Fig. 3.6-2 applies to 
bipolar formats (NRZ and RZ) if the coefficients a k now have three values 
+A, 0, and —A. Because three levels are used, bipolar modulation is 
sometimes called pseudoternary [36]. Another name that has been used is 
alternate mark inversion (AMI) [37], 
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Although receiver timing can be recovered from the bipolar waveform, 
a long string of Os can cause difficulties [18, p. 476], Special sequences of 
pulses can be filled into the no pulse sequence caused by the string of Os. 
The resulting waveforms are called high-density bipolar. The added sequences 
are specially designed to violate the bipolar rule so that they can be detected 
by the receiver. Additional details on these waveforms are given m Spilker 
[18] and Shanmugam [38]. 

Receiver demodulation of the bipolar waveform involves examining 
the amplitudes of samples taken once per symbol interval. For large enough 
sample magnitude, a 1 is decided. If the sample’s magnitude is small 
enough, a 0 is decided. By examining the sign sequence of samples, the 
receiver can also gain knowledge of errors since an alternating sign behavior 
should occur for demodulated Is. 

Manchester Waveform 

The Manchester waveform transmits a 1 as a positive pulse for half 
the symbol interval, followed by a negative pulse for the remainder of the 
interval; a 0 is conveyed by the same two-pulse sequence but of opposite 
polarity. An example waveform is shown in Fig. 3.6-l(e). Other names in 
use for Manchester coding are split-phase, twinned-binary , bi-phase, an 
one-out-of-two. It is also called bi-phase-level [39, p. 11] to distinguish it 
from other waveforms called bi-phase-mark and bi-phase-space. 

The bi-phase-mark signal has a level transition at the start of every 
symbol interval. A 1 causes a second transition one-half symbol period 
later, but a 0 causes no second transition. The opposite behavior occurs 
with bi-phase-space; it also has transitions at the start of each interval, bu 
the second transitions one-half interval later are due to Os, whereas Is cause 

no second transitions. „ . 

The model of Fig. 3.6-2 applies to Manchester coding if the coefficients 

a k are A or -A (as in polar-NRZ), but the network is now defined by 

Jt~(T b /A) 1 \ t - (37y 4)l rt.fi.-n 

p(t) = rect ■ Tb/2 — rect [ T j2 


P{u>) = j T b Sa 


(3.6-8) 


Practical generation of the Manchester waveform results from first 
generation of a polar-NRZ signal and then multiplication by a synchronized 
square-wave clock (amplitudes ±1) having a period T b . 

Because at least one level transition occurs every symbol interval m 
the Manchester signal, receiver clock timing can be extracted from the 
waveform, even in the presence of long strings of Os or Is A disadvantage 
is that it requires twice the bandwidth of a bipolar signal. 
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At least two ways exist for a receiver to demodulate a Manchester 
waveform. In the first, two samples can be taken each symbol interval; a 
1 is declared if samples in a given interval have sign sequence + - , whereas 
a 0 is decided for signs - +. A second approach multiplies the input 
waveform by a synchronized square-wave clock ( + 1 , - 1 amplitudes) of 
period T b . The product will be a polar-NRZ waveform that is then de- 
modulated as previously described. 

Differential Waveform 

The differential waveform is actually the result more of a coding 
technique than it is a format. The waveform often uses two levels A and 
-A, as shown in Fig. 3.6-l(f) but unipolar versions (levels A and 0) are 
also possible. Occurrence of a digit 1 causes a level transition at the start 
of its interval, but a 0 causes no transition. Thus a symbol interval has a 
level different from that of the previous interval only if it corresponds to 
a code 1. This waveform has also been called NRZ-mark [39]. 

A differential waveform, where level transitions occur for Os and no 
transitions for Is, can also be constructed. These are sometimes called 
NRZ-space [39]. 

Waveform generation involves first differentially encoding the given 
binary sequence {b k } to obtain a new sequence {a k } that then becomes the 
code for generating a polar waveform. Figure 3.6-3(a) shows the necessary 
encoder. Prior to the start of the input sequence the output is set to an 
initial value (0 or 1) that is arbitrary. Thereafter, 

a k = b k ®a k . ,. (3.6-9) 

Example sequences are shown in (b) for an initial value 1. 

In the receiver, demodulation is the same as for a polar-NRZ signal 
except now the demodulated sequence, denoted by {aj, must be decoded 
to obtain the receiver’s estimated sequence, denoted by {b k }, corresponding 
to the original sequence {b k }. The decoder is shown in Fig. 3.6-3(c); its 
operation is defined by 

b k = d k @a k . (3.6-10) 
The reader may wish to use the sequence of (b) to verify that (3.6-10) gives 
the correct response code when no channel errors occur due to noise. 

A principal advantage of differential encoding is that the transmitted 
waveform can be inverted without affecting decoding accuracy. This property 
is important when the polar waveform passes through a cascade of networks 
for which the output signal’s polarity may be unknown. 

Duobinary Waveform 

The duobinary waveform is one that again uses a pulse to represent 
a 1 and a negative pulse to represent a 0. However, now the pulses differ 
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Modulo-2 



Figure 3.6-3. (a) Differential encoder, (b) example encoder sequences, and 
(c) decoder. 


from all the waveforms discussed previously in that their durations are two 
symbol intwvals rather than one. Thus a code 1 in a given interval correspond 
to a Dulse occupying the given interval and the following interval. Som 
reflection shows that, after an initial transient inter** ft* t™*^™ 

consists of either positive or negative pulses in some intervals as wen as 
some intervals with no pulses. If the pulses have durations 2T, and amphtudes 
A/2 (for 1) or - A/2 (for 0), the steady-state waveform pulses have amplitudes 
±A ( An example channel encoder sequence is shown m Fig. 3.6-4(a), and 
the corresponding duobinary signal is sketched in (b). The w ^°™ s 
(c) and (d) are discussed later and are included only for companson purpos 

at th '^PO'" t a dvantage of the duobinary waveform is its capability of 
transmuting a Vven sequence with only half the bandwidth o unipolar and 
polar formats (because the basic pulse has duration 2 T b instead ot »). 
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Channel Encoder Sequence { a k } 

110001011 10010 

(a) 



(c) Modified Duobinary 



(d) Miller 

Figure 3.6-4. Waveforms for the digital sequence of (a): (b) Duobinary, (c) modi- 
fied duobinary, and (d) Miller. 


Alternatively, for the same bandwidth, the rate of symbols transmitted can 
be doubled with the duobinary system [40-44]. 

The model of Fig. 3.6-2 can be extended to the duobinary waveform, 
as shown in Fig. 3.6-5. In the basic method of (a), an impulse is generated 
in interval k with amplitude a k = A/2 or -A/2, according to whether the 
binary digit in interval k is 1 or 0, respectively. Each impulse excites the 
filter with transfer function P{<o) to generate a pulse of duration T b . A 
delayed impulse also generates a second pulse of duration T b delayed by 
T b relative to the first. The net effect is the generation of a pulse of duration 
2T b for each binary digit. The reader can readily show that the equivalent 
system of (b) is valid. For rectangular pulses we require 

Jt-{T b /2)~ 

p(t) = rect 

1 b 


(3.6-11) 





For demodulation of the wa>je ; x 
of (a), we have dii l, j" 

Original sequence 
I i <: 

Samples 


Recovered secue-c 

i )t I!.. 

Note that digit 1 (a 1) is rerav. 
to be a 1. This result 
same. If the first two ‘■V, e c, 
two original digits are ifjej^sCjS 

i7‘ " ~ 

The receiver’s demcj&j 
on the previous intervT/v ; 
propagate. However,,' 
differentially encoded 1 , a^' X 
to generate the duob^apW; 
The procedure is called pifl 


t Our simplified discussio; 

llljfflj; 
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Duobinary 

Waveform 



Duobinary 

Waveform 


' k diagrams for generation of duobinary waveforms. 
' ivalent system. 


I = T b Sal 


( 3 . 6 - 12 ) 


follows proper interpretation of samples 

'' interval. The rule ist as follows: If the sample 

declared a 1 (the preceding digit will also be a 

he digit is declared a 0 (the preceding digit is 

,ro the digit is detected as a 1 if the preceding 

I tected as a 0 if the preceding decision was 1. 

a(0 = £ kTb) ~~~~ 

* - -- i 

norm of Fig. 3.64(b) that corresponds to the sequence 

b ", 

1 1 0 0 0 1 0 1 I 1 o 0 1 0 

• AO -A -A000AA0 -A 00 
“ e 1100 0101110010 

_ iered since a sample A means the previous digit had 
_ kTh) - ssible because the first two original digits were the 
aU) ' * --- ‘ "arent, the recovered sequence is indeterminate until 

ul. (Why?) 

Figure 3.6-5. Functional filiation process for a given symbol interval depends 
(a) Basic method and (wfe, general. This fact means that errors tend to 
'' irhe channel encoder’s output sequence is first 
described earlier, and the new sequence is used 
j, V ' waveform, error propagation can be eliminated. 

In the receiver, ““ a 

taken once in each symbof f 
amplitude is A, the dig.* f S c 
1); if the amplitude is -Ak 
also a 0); if the sample is zj& 
digit decision was 0, anc' Jp 


Cvamnln 3.6-1 
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Now the receiver decides a 0 is being received in any given symbol interval 
if that interval’s sample amplitude is either A or - A; it decides a 1 is pres- 
ent if the sample amplitude is zero. No reference to previous intervals is 
required. 

The preceding duobinary waveform generation procedure using pre- 
coding can be stated in two other equivalent procedures. In the first procedure, 
no pulse is used in a symbol interval when the binary sequence symbol is 
a 1; pulses are used for 0s with a given pulse’s polarity reversed from that 
of the previous pulse if an odd number of Is (no pulse intervals) falls 
between. An even number of Is results in no sign inversion. In the second 
procedure the complement of the data sequence is precoded. No pulse is 
used for intervals with 0s; pulses with sign reversal correspond to Is with 
an odd number of 0s between and no reversal in sign occurs for an even 
number of 0s [40]. The preceding detection procedure for precoding now 
gives the complement of the original data sequence. The reader is encouraged 
to verify these procedures using the sequence of Fig. 3.6-4(a). 

Modified Duobinary Waveform 

If a duobinary waveform is delayed one symbol interval (by T b ) and 
subtracted from the undelayed waveform, the result is a modified duobinary 
waveform. The signal retains the bandwidth reduction relative to other 
formats but has the additional advantage that its power spectrum has no 
dc component [43]. 

The difference of two duobinary waveforms is equivalent to generating 
a signal using the basic waveform of Fig. 3.6-6(a) for each symbol 1 in the 
binary sequence and its negative for each 0. With this basic waveform, 
the models of (b) and (c) are readily shown to apply, again with a k = All 
for Is and a k = -A/2 for 0s. From the behavior of p md {t), it is clear that 
any given digital symbol will be correlated with symbols two intervals later, 
which means that any receiver symbol is related to the symbol occurring 
two intervals earlier in time. An example waveform is shown in Fig. 
3.6-4(c) for the symbol sequence of (a). The initial transient interval is now 
two intervals in duration. The filter impulse response p(t) and transfer 
function P{<a) are again given by (3.6-11) and (3.6-12). 

In receiver decoding (no noise assumed), samples in each symbol 
interval indicate symbols 0 and 1 if the sample values are -A and +A, 
respectively. If the sample value is zero, the symbol is the same as occurred 
two symbols earlier. If an error occurs when noise is present, it tends to 
propagate and affect later symbols. 

As in duobinary modulation, error propagation can be eliminated by 
precoding. The coder of Fig. 3.6-3(a) again applies, except the delay is 2T b 
{two symbol intervals). Thus 

a k = b k ® a k - 2 - 


(3.6-13) 
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2T b 

3 T„ 

n 

L 

J ' 


a(f ) * 2 a t 5(/ - kT b ) 

k = -•* 



Modified 

Duobinary 

Waveform 


a (/)= 2 ot k 6(t - kT b ) - 

k - — 


2// > (cu) sin(wr 6 )e 


Modified 
- Duobinary 
Waveform 


Figure 3.6-6. (a) Waveforms used in generating a modified duobmary signal 
with rectangular pulses, (b) block diagram of model system to generate the 
■waveform, and (c) a system equivalent to that of (b). 

In the receiver, the sample in any interval is decoded as a binary 0 if its 
value is zero (no noise case) and as a 1 if its value is either A or -A. 

Millar Waveform 

The Miller waveform results when a level transition occurs at the 
midpoint of each symbol interval corresponding to a 1 m the binary sequence. 
No transition occurs for a 0 unless followed by a 0, in which case a transiUon 
occurs at the end of the interval. An example waveform is shown m Fig. 

3 6-4(d). Miller coding is also called delay modulation 139J. 

A Miller waveform is attractive for magnetic recording and in some 
phase-shift keyed signals because its power spectrum is | smal 
near dc and its bandwidth requirements are approximately half those needed 

by Manchester coding [39]. 

M-ary Waveform 

Our final format example is the M-ary waveform. We briefly considered 
M - ary codes in Sec. 3.4. If the input to the modulator is a sequence of 
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M-ary digits, one digit per signaling interval of duration T b , a logical extension 
of the binary formats is to simply assign a pulse of duration T b to each 
digit that can have any one of M amplitudes. If all amplitudes are nonnegative, 
we have a unipolar format. In a polar format, pulses would be paired so 
that there is a negative pulse having a magnitude equal to one of the positive 
pulses. t If M is a power of 2, such as M = 2 Nb , and each of the M 
amplitudes is equally probable, the information conveyed in each symbol 
interval (assumed independent interval to interval) is / = -log 2 (l/M) = 
N b bits per symbol. The rate at which this waveform can convey information 
is, therefore, N b /T b bits/s. 

Another common problem involves the use of an M-ary waveform to 
represent a sequence of binary digits, one digit per symbol interval of 
duration T b . In this case 

M = 2 Nb (3.6-14) 

pulse amplitudes are associated with N b symbol intervals. For equally 
probable independent binary digits (symbols) in the sequence, the information 



t M is assumed to be even here, as would occur in the important case where M is a 
power of 2. 
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conveyed is N b bits per M- ary symbol. However, an M-ary symbol duration 
is N b T b , so the information rate is l/T b bits/s, the same as in the binary 
sequence. Although information rate is no different than if the sequence 
had been represented by a polar binary waveform, bandwidth is different. 
Because bandwidth is proportional to the reciprocal of symbol waveform 
durations (T b in binary format, N b T b in this type of M- ary signal), the binary 
system requires larger bandwidth by a factor N„. Reduced channel bandwidth 
requirements is one of the major advantages of M-ary formatting. A major 
disadvantage is its greater sensitivity to noise. An example of M-ary formatting 
is shown in Fig. 3.6-7 for M = 8. 

3.7 SPECTRAL CHARACTERISTICS OF DIGITAL FORMATS 

In the preceding section we found that several of the digital waveforms 
could be generated by the networks of Figures 3.6-2, 3.6-5, and 3.6-6. In 
each case a network is excited by the impulse train 

a(f) = £ a k 8(t-kT b ) (3-7-1) 

*= -oo 

where the coefficients a, are related to the message’s binary digits. These 
relationships are summarized in Table 3.7-1. All the networks have the 
form shown in Fig. 3.7-1, where H(o>) is determined by the waveform format 

of interest. 

Power Spectrum 

If Sf{a)) represents the power spectrum of sit) in Fig. 3.7-1, then it 
can be shown that 

Sf(w) = ^»|//(co)| 2 (3-7-2) 

TABLE 3.7-1. Coefficients a t for Various Digital 
Formats. 


unipolar 

polar 

bipolar 

Manchester 

Duobinary 

I Modified Duobinary 


a k for Binary 

Digit 

0 

1 

0 

A 

-A 

A 

0 

±A 

-A 

A 

-A 

A 

2 

2 

-A 

A 

2 

2 


Remarks 


Signs alternate 
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) ► H(u) — *- 


r(r). Waveform 
of Interest 


Figure 3.7-1. Network model for 
study of spectral properties of digital 
formats. 


where S^(co) is the power spectrum of ait) and the network is assumed to 
have a real impulse response. 

Once y a ((o) is found, the spectral properties of a given format are 
obtained from 5^(a>) when //(to) is that which applies to the format. Although 
somewhat involved, it can be shown (Prob. 3-58) that a(t) has the time- 
averaged autocorrelation function 


and power spectrum 


where 


R a (r) = ^ £ RA r - rT b ) 

■* b - oo 


W=r 2 R r* 

b r* — oo 


1 

Rr = lim — 2 

K — “ k= -K 


(3.7-3) 


(3.7-4) 


(3.7-5) 


and the overbar represents the statistical average. 

For the special case where the a k are independent from interval to 
interval (over k) and have the same probability distribution in each interval, 
(3.7-5) reduces to 

fc4, r = 0 

R r = _ , (3.7-6) 

l («*) > r ^ 0. 

We next apply (3.7-2) and (3.7-4) to several specific formats. All are 
assumed to be NRZ type generated from independent binary digits. In 
some cases (3.7-6) applies. In others we must revert to use of (3.7-5). 


Unipolar Format 

Here Is and Os are assumed to occur with equal probability. Since 
a k is a discrete random variable having only the two values 0 and A (Table 
3.7-1) we have 


•M-t- 


+ A {\ 


(3.7-7) 


^ = W b + wA 


2 S (a) - ra> b ), (3.7-8) 
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where (3.7-4) and results of Prob. A-22 have been used and we define 

27 r * 


Finally, |//(o>)| 2 is given by |f , (w)| 2 using (3.6-3), sot 


^SM+^Sa 2 ^ 


5^®) = — 8M + “7 


(3.7-9) 


(3.7-10) 


The continuous part of this function is plotted in Fig. 3.7-2. Bandwidth to 
the first spectral null is <o b . The term (AV/2)8(e>) represents a dc component 
having half the total power. 


^ ^ Polar 


\ . Unipolar 


_ 10 _ \\ \ .Duobinary 


/ \ 


M r\~ '' f'\ 

m L 

f \mf 

n w ml 


Figure 3.7-2. Normalized power spectrums of unipolar, polar, and duobinary 
waveforms. 


t Subscript u is used to imply unipolar format. 
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For an RZ format with pulse duration T b /2, the power spectrum SP K (a) 
has a continuous part with twice the bandwidth of (3.7-10) and has spectral 
lines at odd multiples of ± <u b (see Prob. 3-59). These lines are useful in 
deriving clock synchronization. 

Polar Format 

Here a k has equally probable values -A and A for equally probable 
digits 0 and 1. Thus R 0 = (-A) 2 ® + (A) 2 ® = A 2 , R r = [(-A)® + 
(A)®] 2 = 0 for r + 0, so V a (w) = A 2 /T b . Again \H(a>f = |/>(o>)| 2 of 
(3.6-3) and t 

y p (a>) = A 2 T b Sa 2 (^). (3.7-11) 

This spectrum has the same shape and first-null bandwidth as the continuous 
part of (3.7-10) for the unipolar format. The power spectrum 5^(0 >)/A 2 T b 
is plotted in Fig. 3.7-2. 

Bipolar Format 

Even if message sequence binary digits are independent, the amplitude 
coefficients a k are not because of the constraint that the pulse polarities 
assigned to Is alternate. Thus more care must be given to the evaluation 
of a >flt k+r . From table 3.7-1 a k has values 0, A, and -A^ these occur with 
respective probabilities i, i, and ). Thus when r — 0, a\ = 0 2 ® + A 2 ® 
+ (—A) 2 ® = A 2 / 2 and R 0 = A 2 / 2 from (3.7-5). 

When r = 1 the message sequence (a t , a* +1 ) can have only possible 
values ( 0 , 0 ), ( 0 , 1), (1, 0 ), or (1, 1). The corresponding values of a k a k+ , 
are 0, 0, 0, and -A 2 with probabilities of i each. Hence a k a k+l = 
3(0)® + (- A 2 )® = — A 2 /4, so R t = -A 2 /4 from (3.7-5). 

When r > 1, half of the possible message sequences (a t , a* +1 , . . . , 
a k+r ) have a k = 0 and lead to a*a* +r = 0. Another one-fourth of the 
possible sequences have a k = 1 but have a k+r = 0, so again a k a k+r = 0. 
The remaining one-fourth of the possible sequences have both a k = 1 and 
a k+r = 1, so a k a k+r = A 2 or -A 2 , depending on how many Is or 0s fall 
between digits k and k + r in the message sequence. There are r - 1 
digit positions between a k and a k+r so there are 2 r ~‘ possible binary “words” 
in the r - 1 digits. Examination shows that half of these words have an 
odd number of Is and half have an even number. An even number results 
in a*a t+r = -A 2 , whereas an odd number gives a k a k+r = +A 2 , each half 
the time. These considerations collectively mean that a k a k+r = 0 for 
r > 1. 


t Subscript p denotes the polar format. 
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From (3.7-4) we now have 


q> ( \ _ d. _ i + i _ I e -> T ‘ 
a( ) T^L 4 2 4 j 


(3.7-12) 




so the bipolar waveform power spectrum, denoted by Sf bi (a>), is 


y bi (u>) = A 2 r i) sin 2 l 1 Sa' 


(3.7-13) 


from (3.7-2) and the fact that \H(wf = |P(oj)| 2 with (3.6-3) applying. A 
plot ofy bi (o))/A 2 T h is shown in Fig. 3.7-3. Although the first-null bandwidth 



Figure 3.7-3. Normalized power spectrums of bipolar, Manchester, and modified 
duobinary waveforms. 
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is o> b , the same as unipolar and polar formats, there is only small spectral 
content for frequencies near dc. 

Manchester Format 

For independent and equally probable message digits, the values A 
and -A of a* occur independently with equal probability. We readily find 
that R 0 = A 2 and R r = 0 for r > 0. Thus 5^(<u) = A 2 /T b and the Manchester 
waveform’s power spectrum, denoted by Sf man (<n), becomes 

U«) = A 2 w(^)sa 2 (^). (3.7-14) 

In obtaining (3.7-14) we have used (3.6-8) and (3.7-2) with |//(co)| 2 = |/ > (o>)| 2 . 
The first-null bandwidth of is 2o>„, twice that required by bipolar 

signaling. Figure 3.7-3 shows a plot of 9‘ man {u>)/A 1 T b . 

Duobinary Format 

Even though duobinary pulses are correlated in adjacent intervals for 
independent message digits, this correlation is accounted for in the model 
of Fig. 3.6-5. Thus a k and ct k+r are independent for independent message 
digits. Since the values of ot k of A/2 and -A/2 are equally probable for 
equally probable message digits, we readily find that R 0 = A 2 / 4, R r = 0 
for r > 0, so ^„(o>) = A 2 / 4 7/ and the power spectrum, denoted by ^(cu), 
becomes 

*,(«) = A 2 7/ cos 2 (^)sa 2 (^) , (3.7-15) 

where (3.6-12) and the network transfer function of Fig. 3.6-5(b) have been 
used in (3.7-2). 

A plot of y d (u))/A 2 T b is shown in Fig. 3.7-2. First-null bandwidth is 

oj b /2. 

Modified Duobinary Format 

Developments for this waveform parallel those for the duobinary format 
except the waveform model is that of Fig. 3.6-6. The modified duobinary 
waveform’s power spectrum, denoted by Sf md {o)), is now 

y m A<») = A 2 T b sin 2 {o)T b ) Sa 2 (^j . (3.7-16) 

&’ md {u>)/A 1 T b is plotted in Fig. 3.7-3 which shows a first-null bandwidth of 
t» b /2 and small values for frequencies near dc. 



120 


Baseband Digital Waveforms Chap. 3 


Miller Format 

The power spectrum of this format is more difficult to determine than 
those described earlier. It is given by Lindsey and Simon [39, p. 21] and 
shows a sharply peaked response near co = ±0.380* with low-level values 
(about 12 dB below peak response) out to about o = ±1.70*. Other 
“sidelobe” responses above o = ±2o* fall off at the same relative rate 
(as l/o 2 ) as the other formats discussed earlier. 

3.8 TIME MULTIPLEXING OF BINARY DIGITAL 

WAVEFORMS 

Time division multiplexing was briefly introduced in Sec. 2.8. It was shown 
how samples from a number of messages could be interlaced in time for 
combined transmission over a single channel. It was left as more or less 
obvious to the reader that the same techniques could be applied to other 
systems. In this section we extend these earlier concepts to show how 
many messages may be time multiplexed using the pulse formatted waveforms 
described in the foregoing sections of this chapter. The multiplexer (MUX) 
involved is a digital multiplexer. 

Generally, a digital multiplexer can be considered as a parallel-to- 
serial converter. It accepts a set of inputs (or messages, often called trib- 
utaries) applied in parallel and interlaces the inputs into a single output 
signal having specific time intervals allocated serially to each message. In 
a word-interleaved multiplexer each of these time intervals contains a complete 
word for a given message. The serial output would then comprise a word 
due to message 1, a word for message 2, etc. A bit-interleaved multiplexer 
places a single input bit in each output interval; the output here would 
consist first of a sequence of the first digits from each of the input words, 
then a sequence of the second digits from all inputs, and so on. As in 
Chap. 2 we use the term frame to refer to the smallest time interval in the 
output signal that contains at least one sample (word) from all input messages. 
A frame can also include some time allocated to other functions, such as 
synchronization and signaling (dialing and ringing, for example, in telephone 
systems). 

Timing of all operations within a multiplexer is controlled by a highly 
stable oscillator called the master clock. If the digital structure of all the 
input messages is determined from the same master clock, the multiplexer 
is called synchronous. If different clocks are involved, the multiplexer is 
said to be asynchronous . Of the two, synchronous multiplexers are more 

straightforward. , 

In practice, digital multiplexers can be divided into two groups [36J. 
In the first group are multiplexers designed to combine inputs from a number 
of low-speed data terminals. Output bit rates of these multiplexers are 
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typically 1200, 2400, 3600, 4800, 7200, or 9600 bits/s when used with modems 
(modulation-demodulation system) designed to interface with commercial 
voice-grade telephone lines. Lower rates apply to the usual dial-up grade 
line, and the higher rates apply to specially conditioned privately leased 
lines. 

The second group of multiplexers typically use high bit rates and are 
of the types used by common carriers such as American Telephone and 
Telegraph (AT&T) in the United States, Nippon Telegraph and Telephone 
(NTT) in Japan, and others all over the world. The hierarchies of these 
systems are subsequently outlined but involve both synchronous and asyn- 
chronous multiplexing. The recent trend of AT&T has been toward a 
nationwide synchronous system with master clock located in Hillsboro, 
Missouri [37]. 

AT&T D-Type Synchronous Multiplexers 

The basic first-level multiplexer in the AT&T system is a synchronous 
device. Three models are in use. The oldest, called a D1 channel bank, 
was introduced in the early 1960s [45]; it accepts 24 analog voice frequency 
(VF) messages at its input, samples each at an 8-kHz rate, compresses 
samples according to a p — 100 law, and encodes each sample using 8-bit 
words (7 bits for amplitude, 1 bit for signaling). One additional bit is added 
per frame for synchronization so 1 + 24(8) = 193 bits are present per 
frame of duration 1/8(10 3 ) = 125 /as. Output bit rate becomes 193 bits/125 
/as = 1.544 megabits/s. The D1 bank uses word interleaving. 

In 1970 the D2 channel bank was introduced [46, 47]; it provided 
improved performance compared to the D1 bank. The D2 bank accepted 
96 analog voice input messages and multiplexed these in groups of 24 onto 
four 1.544 megabits/s output lines. Thus each output line carries 24 VF 
messages, as in the D1 unit. Multiplex formatting used in the output lines 
was slightly different than the D1 unit but was the same as that used in a 
newer channel bank, the D3, introduced in the mid-1970s as a replacement 
for the D1 bank. Because of similarities, we describe only the D3 bank. 

The D3 channel bank [47] multiplexes 24 analog VF input messages 
into a 1.544 megabits/s output (called a T1 line) that can be combined by 
additional multiplexing with other similar lines, if desired, or transmitted 
directly. As in the D1 and D2 systems, message samples are at an 8-kHz 
rate. Samples in the D3 bank are compressed using a p = 255 law and 
coded using 8 bits per sample. One additional bit is added per frame for 
synchronization for a total of 193 bits per frame. A frame is 125 /as in 
duration. For 5 frames out of every 6, each message is conveyed by full 
8-bit words using a bipolar waveform format. However, in every sixth 
frame, the eighth digit is stolen for signaling purposes. In some systems 
two signaling paths are required, so a 12-frame sequence called a multiframe 
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[48] (or superframe [46]) spans both message and signaling operations. For 
frame synchronization the framing bit in odd-numbered frames has the 
digital sequence 101010; thus frames 1, 5, and 9 carry a 1, whereas frames 
3, 7, and 11 carry a 0 in each multiframe. To align the multiframe the 
even - numbered framing bits have the sequence 001110. This sequence allows 
frames 6 and 12 to be identified as following transitions 01 and 10, respectively. 
Figure 3.8-1 illustrates the format of a typical frame in the D3 channel bank 
that uses word interleaving. 



, 2 3 4 5 6 7 8 


I Signaling 
Bit 


Figure 3.8-1. Format of typical frame for D3 channel bank. 

Hierarchies of Digital Multiplexing 

Several levels of multiplexing may be combined to form a hierarchy 
of multiplexers. Three principal hierarchies exist in the world today. In 
the United States and Canada the system of Fig. 3.8-2 is used. It is known 
as the T-carrier system of AT&T. At the lowest level the D3 channel bank 
is commonly used to generate the multiplexed signal that becomes one of 
four possible inputs (over four T1 lines) to the M12 second level multiplexer. 
The M12 multiplexer output has a 6.312 megabits/s bit rate and drives a 
T2 line. Higher multiplexing levels (up to the T5 line) are possible as shown. 
All the higher multiplexers (M12, M23, and so on) are asynchronous, and 
the M12 device is discussed in the next subsection as an example of such 

multiplexers. „ , , , 

Variations exist in the implementations of the hierarchy ot Fig. 
3.8-2. The M23 multiplexer can be bypassed using an M13 device having 
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28 Tl-line inputst [49, 50]. Furthermore, the various T-lines do not always 
have to have analog voice signals as their original source messages. A data 
bank can be used to multiplex digital sources to generate a 1.544 megabits/s 
output with the correct format to pass over a T1 line. A coder can be used 
to digitize visual telephone signals for T2 line inputs to M23 multiplexers 
[51]. A codec (a system for coding and decoding) can be used to make an 
analog mastergroup signal (a waveform resulting from frequency multiplexing 
600 telephone signals) compatible as an input to an M34 multiplexer [45, 
51]. Finally, using current-day technology, digital color television signals 
can be generated with a small enough bandwidth to be compatible with a 
T3 line [49]. i 

The digital hierarchy used in Japan is much the same as that in Fig. 

3.8- 2 at the lower levels. It differs in the higher levels as shown in Fig. 

3.8- 3. 

Except in North America and Japan, most of the rest of the world 
uses the hierarchy suggested by the Consultative Committee on International 
Telegraphy and Telephony (CCITT), as shown in Fig. 3.8-4. As with the 
AT&T hierarchy, level 3 can be bypassed; in this case an M24 multiplexer 
generates a level-4 waveform by multiplexing 16 level-2 signals [50]. 

Asynchronous Multiplexing 

In asynchronous multiplexing, the input digital bit streams originate 
from sources with different clock rates. Even if the sources have clocks 
with the same average rates, these rates can change with time due to natural 
instabilities in the oscillators that generate them. Furthermore, even if all 
the source clocks are perfectly synchronized, variations in delay of the 
channels connecting the sources to the multiplexer input can cause variations 
in the bit rates of the arriving digital signals. A commonly used method 
for overcoming these problems is to design the multiplexer output bit rate 
to be at least as large as the sum of the input rates at their largest possible 
values. With this choice the rate of the input bits (or words) is collectively 
smaller than the output bit (or word) rate. Times will occur when input 
data are not available at output clock instants. Such voids are filled by bit 
(or word ) stuffing, the adding of dummy bits or words. Control bits in the 
output data format are used to identify where stuff bits or words are located 
so the receiver demultiplexer can remove them. 

Because the concepts involved in either word or bit stuffing are similar, 
we discuss only bit stuffing by means of an example system using bit 
interleaving. Consider a multiplexer such as the AT&T M12 device of Fig. 

3.8- 2, where all of the input lines (tributaries) have the same average rate, 

t The digital signal on a line Ti, / = 1,2 5 is sometimes called a DSi signal; 

thus a DS3 signal is used on a T3 line. 

t For details on the M13 and M34 multiplexers, see [52]. 
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Figure 3.8-4. Digital hierarchy suggested by CCITT applicable to most of the world 
(United States, Canada, and Japan excluded). 
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denoted by R t . For an output total bit rate R a and a multiframe time 
T mf , the total number of bits in a multiframe will be 

RJ m/ = N(M d + M c ), (3.8-1) 

where N is the number of tributaries, M d and M c are the numbers of data 
and control bits, respectively, per tributary per multiframe. Now suppose 
that the average number of stuff bits added to each tributary per multi- 
frame is s. Then the total number of bits generated from tributaries is 
N[R,T* + j], which must equal the number of output data bits NM d 
(because all source bits, even stuff bits, are data to the output line). Hence 

R i T mf +s = M d . (3.8-2) 

By solving (3.8-1) for T mf and substituting into (3.8-2) we have 


, ,J, N Ri(M d + M c ) 

1 T — \- 

Alternatively, 

NR,(M d + M c ) ( M d \ 


(3.8-3) 

(3.8-4) 


By proper choice of parameters, s can be small, whereas a relatively large 
variation in input bit rate is allowed. 


Example 3.8-1 

In a given multiplexer suppose R t = 1.544 megabits/s, R„ = 6.312 megabits/s, 
M d = 288 data bits per tributary per multiframe, M c - 6 control bits per tributary 
per multiframe, and N = 4 tributaries. From (3.8-3) 



1 - 


4(1,544)294 ' 

6.312(288) 


0.3346 


stuff bits occur per tributary per multiframe, on the average. 


The values of the above example apply to the AT&T M12 multiplexer, 
which is designed to allow only one stuff bit, maximum, per tributary per 
multiframe. By retracing the procedures leading to (3.8-4), we find that 
the equation applies to instantaneous rates if all tributaries are assumed to 
have the same rate. That is, we can remove the averaging bars and write 


Ro (M d - s) 
N(M d + MX 


(3.8-5) 


where R, and s are momentary values. In the M12 multiplexer Osjs 1, 
so 1.5404 megabits/s =s /?, s 1.5458 megabits/s for a tolerance of 1.544 
megabits/s ( + 0.116%, -0.231%). This allowable range of input bit rates 
is larger than that expected from current-day stable clocks (easily on the 
order of ± 50 parts per million or better) but is needed to account for timing 
jitter and reduce reframe time of the multiplexer [22, p. 613]. 
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The detailed structure of the multiframe format of the M 12 multiplexer 
is shown in Fig. 3.8-5 [22]. Each row will be called a frame and all four 
rows, transmitted in sequence, are called a multiframe. Positions marked 
M F or C, with subscripts, represent single control bits where the subscripts 
on' M and F denote the bits’ digital values (0 or 1). The F-sequence is an 
alternating 0,1-sequence that allows frame identification for synchronization 
purposes. After frame lock-on, the receiver’s demultiplexer searches the 
M-sequence to identify its pattern (which is 0111) for multiframe synchro- 
nization. After frame and multiframe lock, the demultiplexer identifies data 
bits that occur in blocks of 48 bits between control bits in Fig. 3.8-5. The 
C-sequence, which is discussed later, identifies the location of stuff bits. 


M 0 

(48) 

c, 

(48) 

F 0 

(48) 

c. 

(48) 

c, 

(48) 

F, 

(48) 

M, 

(48) 

C n 

(48) 

F 0 

(48) 

c„ 

(48) 

Qi 

(48) 

F, 

(48) 

M, 

(48) 

Cm 

(48) 

F 0 

(48) 

Cm 

(48) 

Cm 

(48) 

F, 

(48) 

M, 

(48) 

Civ 

(48) 

F 0 

(48) 

Civ 

(48) 

Civ 

(48) 

F, 

(48) 


Figure 3.8-5. Multiframe format of AT&T M12 multiplexer. 


Each 48-bit data block involves interleaving one digit from each of 
the four inputs and then repeating the pattern until 12 digits are taken from 
each of the four inputs. Due to design, only one stuff bit per frame can 
occur; if a bit has been stuffed into input i (i = 1, 2, 3, or 4), it is placed 
in the first data bit’s position (of the 12 for input i) that follows F, in frame 
i. To identify where stuff bits occur, the C-sequence in each frame is used. 
A C-sequence 111 corresponds to a stuff bit, whereas a normal sequence 
000 indicates no stuff bit. Thus control bits C m would have the sequence 
111 in frame III if a stuff bit has been inserted; otherwise the C„ r 
sequence is 000. For high reliability, majority logic is used in detecting the 
C-sequences (two Is of three is a stuff bit, two 0s of three is no stuff bit) 
because an error in a stuff-bit decision will result in an error m all bits of 
the entire frame of the T1 system in question [53, p. 163]. 

Time Division Multiple Access 

Time division multiple access (TDMA)t is a time-multiplexing method 
important in communication satellite systems. In this system various earth 
stations may transmit information to a shared satellite during an assigned 
time interval. All stations transmit on the same frequency (about 6 GHz) 
and receive down-link transmissions from the satellite on the same frequency 

t Although this chapter is mainly on baseband systems whereas TDMA is a earner 
(bandpass) system, we include it here because it illustrates the time-mulUplexmg pnnciple. 
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(about 4 GHz).t The system is synchronous so that when a station is 
transmitting, it is the only one and it has access to the satellite’s full 
bandwidth. 

To utilize a TDMA system a station must store its input (user) data 
for the period of time between its transmissions, called bursts, and then 
read the data out for transmission in a burst at a very high bit rate. Because 
stations occupy different distances from the satellite, which itself can change 
position with time, careful control over burst timing is essential. These 
characteristics require stations to have relatively large memory buffers and 
generally be under computer control. 

A typical format for data passing through the TDMA satellite is shown 
in Fig. 3.8-6. A frame consists of arriving bursts from earth stations being 
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served. The bursts do not have to have the same durations and can be 
longer for heavy-traffic stations. Guard intervals G u G 2 , . . . , G N+] are 
present to absorb small timing errors in burst arrivals so that overlap is 
prevented. An interval S is allocated for frame and system synchronization, 
this transmission originates from a single station called the reference station 
[31, p. 238], Bursts consist of a preamble, a data portion; and sometimes 
a postamble is present to initialize decoders for the following burst. 

The preamble consists of portions used for carrier recovery (CR) in 
coherent systems, bit timing recovery (BTR), burst synchronization using 
a unique word (UW), station identification coding (SIC), and other mis- 
cellaneous items, such as command and control, signaling, and error mon- 
itoring. The data or message segment is subdivided into time slots allocated 
to the stations for which the data are intended, as shown in Fig. 3.8-6. 

After amplification and frequency shifting, the satellite retransmits its 
signal back to the earth stations, which must lock to the bursts received 
and sort out the data streams unique to their locations. 

Other TDM Methods 

In the TDMA method just described, the earth stations were required 
to transmit only during assigned burst time slots. Another TDM method, 
called ALOHA [55], allows any network station to transmit bursts, called 
packets, at any time. Since the packets are unsynchronized, they may 
overlap at the satellite, creating what is called a collision. Collisions necessitate 
that the stations retransmit their packets after a random delay period. 
Repeated collisions require repeated delays and retransmissions. ALOHA 
makes use of the fact that most stations have significant idle time but it 
fails when heavy traffic loading of the satellite occurs. 

In a modification of the ALOHA system, called slotted ALOHA, 
stations may transmit only during prescribed time slots, but may use any 
of the allowed slots randomly. In principle, this method eliminates partial 
collisions but does not prevent full collisions from happening in any given 
time slot. 


3.9 SUMMARY AND DISCUSSION 

This chapter is mainly concerned with describing the functions shown in 
the digicom system block diagram of Fig. 1.3-1 for the special case of 
baseband transmitted waveforms. The effect of noise on the channel is not 
considered because this topic is reserved for detailed discussion in the next 
chapter. Our main efforts concentrate on the ways in which digital waveforms 
are generated from analog or digital message sources (transmitter problem). 
Since the functions of the receiver are essentially the inverses of those in 
the transmitter, these functions receive less attention. 
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Initially it is shown that an analog message can be sampled and quantized 
to obtain a digital representation of the analog source. It is found that all 
quantizers limit the signal-to-noise ratio that can be achieved when message 
reconstruction is performed. The largest signal-to-noise ratio achievable, 
denoted by ( S 0 /N q ), is called the signal-to-quantization noise power ratio. 
For the very important uniform quantizer, that has a uniform separation 
8v b etween adjacent quantum levels, and a message fit) with average power 
f\t), it is shown that 



when the analog message is bounded or when the extreme quantum levels 
are set so that quantizer overload is negligible. By setting the extreme 
quantum levels properly (Fig. 3.2-4), the uniform quantizer can be optimized 
to give the performance shown in Fig. 3.2-5. The curves of Fig. 3.2-5 show 
clearly that the optimum uniform quantizer is sensitive to shape mismatch, 
which occurs when the probability density of the message differs in form 
(shape) from that for which the quantizer is designed. 

By generalizing to allow nonuniformly separated quantum levels, a 
quantizer can be optimized. A procedure is outlined to determine the 
optimum quantum levels. For the optimum nonuniform quantizer, (SJN q ) 
is given in Fig. 3.2-2. For a large number (L) of quantum levels, (3.2-19) 
can be used to obtain ( S a /N q ) quite accurately. 

The optimum nonuniform quantizer is equivalent to a uniform quantizer 
operating on a signal that has passed through a properly chosen nonlinear 
network called a compressor. The companded quantizer of Sec. 3.3, which 
is defined by the nonlinear function v{f) of (3. 3-7), is_ optimurnjbr large 
L. Its performance is determined by (SJN q ) = f\t)/s\ , where z q is given 
by (3.3-6). Often used nonoptimum companded quantizers, the p- law and 
A-law devices, are also briefly described. 

The next major segment of this chapter concerns the conversion (source 
encoding) of the message levels into a suitable digital representation, usually 
a sequence of binary digits 0 and 1. Various encoding methods are described, 
including the optimum, or Huffman, procedure that minimized the average 
number of binary digits needed to represent a source. 

In some systems additional coding, called channel encoding, is used 
to improve the system’s performance with channel noise present. Both 
block codes and convolutional codes for this purpose are discussed. By 
use of these codes, the receiver can correct certain errors it makes in 
initially determining which binary digits are transmitted. 

Next, a number of important waveforms (formats) are discussed that 
can be assigned to represent the channel-encoded binary digits. The spectrums 
of these waveforms are also determined. There is no optimum waveform 
choice for all systems, but some comparisons can be made [56]. In cases 
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where bandwidth conservation is not too important, retum-to-zero (RZ) 
r ype formats may be attractive. The polar-RZ format, for example, always 
has two level transitions per symbol interval, which allows extraction of 
svmbol timing. In applications where bandwidth conservation is important 
the NRZ-type formats are best with duobinary and modified duobmary being 
the better of the ones discussed. Although symbol synchronization can be 
derived from these waveforms, problems can arise if long strings of either 

° S ° f Naturally , bandwidth and synchronization characteristics of a format 
are not the only important properties of a waveform format. Its performan 
in noise is also important. We consider this subject in thefol lowing chapter 
Finally, this chapter closes on a more practical level by discussing 
ways of time multiplexing digital waveforms. Examples of synchronous 
and asynchronous multiplexers are given. 
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PROBLEMS 


3-1. A random analog message has values -8 V s fit) s 8 V. If samples of 
fit) are to be quantized using an 8-bit binary code in a uniform quantizer, 
what voltage levels form the boundaries of the quantizer’s intervals? What 
is a reasonable set of quantum levels if no further knowledge is available 
about /(f)? 

3-2. A message /(f) has a probability density 



-V«/« V 
elsewhere. 


It is applied to a uniform quantizer with eight quantum levels ±1, ±3, ±5, 
and ±7 V centered in uniform 2-V intervals, (a) What maximum value can 


V have without amplitude overloading the quantizer? (b) What is the power 


in /(f) when at its reference level found in (a)? 


★3-3. A message and quantizer are defined as in Prob. 3-2 except V can have any 
value._(a) Use (3.1-11) to find an expression for mean-squared quantization 
error e? valid for any value of V. (b)_Find the power in /(f) for any value 
of V. (c) Sketch the ratio f 1 /e\ versus/ 2 //? where/? is the signal’s reference 


power level. 
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*3-4. Work Prob. 3-3 except assume a message defined by 


_J _ -V*f*V 

PAD = 2V(1 - e ') 

0 ( elsewhere. 

3-5. If So is small and L is large show that the middle terms of (3.2-3) will reduce 
to (8 u) 2 /12. 

3-6. A message /(f) has a probability density 


PAD = 


|/| < 4 V 
elsewhere 


„l,«n .t it. referee levd tor , uniform 

SL,S F (c”ho;Z.V .«* « r«,n«d .0 provide , — > 
vLe of (SJN,) = 131,072.0 with no overload? 

★3-7 A random message with probability density 

! [ V2|/l\ 

w(/) = # e Tv?) 

is Quantized in a quantizer having a large number (L) of levels between 
extremes -V, V. Assume L is so large that extreme quantum levels are 
I = -V and ~ V. (a) Solve (3.2-6) for the signal-to-quantization noise 
power ratio, (b) Plot (SjN q ) in decibels versus normalized message power 
W 2 in decibels for L = 16 and 128. (c) Compare your result w.th Fig. 

3.2-1 for a uniformly distributed message by assuming |/(f)Lx m the figure 
(and Example 3.2-1) equals V. 

3-8. Use Leibniz’s rule to derive (3.2-10) and (3.2-11) from (3.2- ). 

3-9. Show that (3.2-12) derives from (3.2-11). 

3-10. A quantizer has 32 nonuniform quantum levels that are optimally to work 
with a Gaussian random message at (reference) power level / (f) - 07 - 
fal At what voltages should the extreme quantum levels /, and l n be set. 
(?) Whit value OH W can be expected when message recovery occurs 

in the receiver? 

3-11. Work Prob. 3-10 for a message with a Laplace probability ensi y. 

3.12 Work Prob. 3-10 for a message with a gamma probability density. 

-* »•*" a Quantizer must be designed to operate with a Gaussian message and have 
3 ' 15 ' separated levels. » “f 

must have a signal-to-quantization power ratio of 20.3 dB, . how many quamum 
levels are required? (b) What loading factor shouW *e quantoer^^ 
(c) If the extreme quantum levels are to be optimally 
should the signal’s reference power be? 
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3-14. Work Prob. 3-13 for a message with a Laplace probability density and a 
signal-to-quantization power ratio of 20.6 dB. 

3-15. Work Prob. 3-13 for a message with a gamma probability density and a 
signal-to-quantization power ratio of 17.5 dB. 

3-16. For a Gaussian message for which v = 2 in (3.2-22) show that (3.2-19) will 
reduce to 

S„\ L 2 



3-17. Assume L = 32 and use the result of Prob. 3-16 to compute the approximate 
value of ( SJN ,). Compare your result to the value taken from Fig. 3.2-3. 

3-18. The optimum nonuniform quantizer reduces to (becomes) a uniform quantizer 
when the message has a uniform distribution. Assume such a message and 
show that (3.2-19) does give the performance of the uniform quantizer (which 
is given in Example 3.2-1 for R = 1). That is, show that 



3-19. Use (3.2-14) to prove that the optimum quantizer for a uniformly distributed 
message is uniform. 

3-20. Sketch the optimum compressor characteristic v(f)/V, as given in Example 

3.3- l.versus f/V for values of V/V/j = 5, 10, and 20. Which value of 
V/Vfl corresponds to the greatest message compression? 

3-21. For a signal having a Laplace probability density show that (3.3-16) is true 
if V is large enough relative to / 2 so that p { (f) is negligible for |/| > V. 
★3-22. A message /(/), to be quantized after using a /x-law compressor, has a uniform 
probability density p/(f) on -V f < f < V/ where V s =s F of the compressor, 
(a) Find / 2 and |/| for this message, (b) Use (3.3-15) to compute and plot 
( SJN q ) for n = 255 with L = 16 and 128. Compare the results with Fig. 

3.3- 4 for the message having a Laplace probability density. 

3-23. Sketch the A-law compressor characteristic v(f)/V of (3.3-17) versus 0 =s 
f/V 1 for A = 10, 100, 1000. Compare your curves with those of Fig. 

3.3- 3 for /x-law. 

3-24. (a) Assume a message has the Laplace density of Example 3.3-1 and is to 
be quantized by the A-law characteristic of (3.3-17). If f 2 /V 2 is small enough 
(say 0.01 or smaller) so that pjf) is negligible for |/| > V, use (3.3-5) with 
/, = -V and_/ t+I = V to find (SJN,) = (S./e 2 ), (b) Plot ( SJNJ/L 2 
in dB versus / 2 /V 2 in dB for f 2 /V 2 =s 0.01 when A = 100. Compare your 
results with the /a = 255 curve of Fig. 3.3-4. 

3-25. An eight-digit natural binary code is used to represent the integers 0,1, . . ., 
255 where 00000000 represents the number 0. Write the binary codewords 
for integers (a) 14, (b) 27, (c) 145, (d) 201, and (e) 237. 

3-26. (a) What is the minimum number of code digits required to work Prob. 3-25 
if a ternary code is used? Find the codewords for the integers (b) 19, 
(c) 35, (d) 166, and (e) 199. 
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3-27. Work Prob. 3-26 for a quaternary code. 

3-28. A binary code of length 8 is used to represent the levels of a discrete message 
source. How many levels can the source generate for a unique representation? 
3-29. Assume a digital message generates the decimal levels of Table 3.4-1 with 
the following probabilities: 


Levels numbered 

0, 15 

1, 14 

2, 13 

3, 12 

4, 11 

5, 10 

6, 9 

7, 8 

Probability 

0.001 

0.002 

0.007 

0.020 

0.05 

0.08 

0.14 

1 0.2 


Find the average number of binary Is transmitted per codeword when coding 
uses (a) a natural binary code, (b) a folded binary code, (c) an inverted 
folded binary code, and (d) a Gray code. 

3-30. Write the natural binary codewords that correspond to the set of integers 

O, 1, 2, . . . , 31. (a) How many digits are required in a 4-ary code to 
represent these same integers? (b) Write an orderly (increasing numbers) 
set of 4-ary codewords corresponding to the integers. Note that you have 
performed a simple binary-to-M-ary code conversion with M = 4. 

3-31. A digital source can generate any one of four symbols k , Z 3 , and U that 
occur with probabilities P\ = 1/8, P 2 = 1/8, Pi — 1/4, and P t - 1/2, 
respectively, (a) Find the information contained in each source symbol, (b) 
Find the entropy of this source. 

3-32. For the source of Prob. 3-31 what is the code efficiency if source symbols 
are coded natural binary? 

3-33. If binary codewords 0, 10, 110, and 111, respectively, represent source 
symbols Z 4 , Z 3 , Z 2 , and /„ in Prob. 3-31, what is the code’s efficiency? 

3-34. A digitized source can generate any one of eight symbols at any one time. 
The symbols Z„ Z 2 , . . . , l, occur with probabilities P, = 0.01, P 2 = 0.06, 

P, = 0.12, P, = 0.30, P 5 = 0.22, P 6 = 0.16, P 7 = 0.11, and P s = 0.02, 
respectively, (a) Find the Huffman code for this source, (b) Find the code’s 
efficiency, (c) What would the efficiency be if a Gray code had been used 
instead? 

3-35. A digital source can generate eight symbols Z ( , Z 2 , . . - , Z» with respective 
probabilities Pi = 0.33, P 2 = 0.22, Pj = 0.16, P 4 = 0.1, P 5 - 0.09, P« - 
0.04, P 7 = 0.035, and P 8 = 0.025. Find the Af-ary Huffman code for M - 
3 (ternary coding). 

3-36. Symbols A and B of a binary source have probabilities P(A) = 0.7 and 
p(B) = 0.3. Symbols over time are statistically independent. Use Huffman 
source encoding and find the entropy and code efficiency of (a) the source, 
(b) the second-order extended source, (c) the third-order extended source, 
and (d) the fourth-order extended source. 

3-37. Work Prob. 3-36 except assume P(A) = 0.9 and P{B) = 0.1. 

3-38. A ternary source can generate three symbols. A, B and C with probabilities 
P(A) = 0.6, P(B) = 0.3, and P(C) = 0.1. Use Huffman binary source 
encoding and find the entropy, codewords, and code efficiency of (a) the 
source, and (b) the second-order extension of the source. 

3-39. An (n, k) block code with single-error correction capability is desired for a 


source. Consider values of k of 4 k « 12 and find the smallest number 
of check digits that can be used for each value. Find each code’s efficiency. 
Which of your codes can be perfect? 

3-40. Work Prob. 3-39, except assume a two-error correction code is desired. 
3-41. A nonsystematic (n, k ) code has the parity check matrix 


[»] 


1 1 1 000 
0 10 110 
100101 


(a) What can you do to put this matrix in systematic form? (b) What are 
n, k, and r? (c) How many errors, if any, can this code correct? (d) Use 
the systematic form of [ H ] and find the code’s generator matrix. 

3-42. The generator matrix for a (7, 4) code is 


[1000 

1 1 0 ] 

0 100 

0 1 1 

00 1 0 

1 1 1 

000 1 

101 


The input data words to the encoder are derived from a 4-bit Gray source 
encoder. Make a table of the input codewords and their corresponding 
channel codewords. 

3-43. (a) Find the parity check matrix for the encoder of Prob. 3-42. (b) At the 
receiver the codeword 1011001 is received over the channel. If any errors 
are present it is unlikely that more than one exists. With this assumption, 
is the received codeword in error? If so, in which digit? 

3-44. Find code efficiency for the first five nontrivial Hamming codes. How many 
trivial Hamming codes are there? Why would these be called trivial? 

3-45. Find both the parity check and generator matrices for a (15, 115 Hamming 
code. 


3-46. Find the Hamming distances between the codeword 0100101111 and each of 
the codewords 1010101010, 1110100000, 1111000000, and 0101010101. 

3-47. A (6, 3) code has a generator matrix 


[G] 


1001 101 
010;001 
oo 1 1 no 


(a) Find all possible channel codewords if the source code is 3-bit natural 
binary, (b) Find the smallest Hamming distance (called the minimum distance 
d m J of all channel codeword pairs. If d min is at least 3, this code can correct 
at least one error per codeword. 


3-48. A digit sequence 10100 is input to the convolutional encoder of Fig. 3.5-2. 
Show the path through the code tree and the output sequence generated for 
this input. 


3-49. The encoder of Fig. 3.5-2(a) is modified so that the left and right adder 
outputs are S, © S 2 and S 2 © S 3 , respectively, (a) Sketch the new code 
tree, (b) Indicate the tree path corresponding to the input sequence 100110 
and find the corresponding output sequence. 
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3-50. Draw the treUis path applicable to the encoder of Prob. 3-49. 

3 51 A receiver demodulator for the convolutional encoder of Fig. 3 ' 5 ^ determl "“ 
that 'the deceived channel sequence is 110100001000011100110010 where he 
ast four Se known to be tail digits. Use the Viterbi algorithm with the 
w trellis of Fig. 3.5-3 to find the original data word. Based on 
the recovered data word being correct, were any channel errors introduced 
into the received sequence? If so, which digits? 

3-52. Find the output sequence of a differential encoder to the input sequence 
1011100100011 when the output sequence is initially 0. 

3-53. Let p(t) of (3.6-11) for the duobinary waveform be replaced by 

_ ir(f — V2) l 

p(t) - Sa|^ j b ]• 

(a) Sketch duobinary waveforms using pit) and (3.6-11) for the data sequence 
01101110001001 (b) At what times should the two waveforms be sampled 
L a reLTer to produce the same sample values? (c) What advantage does 
the given waveform have compared to (3.6-11)? 

3-54 Precode the data sequence 001011101001 and construct a duobinary wavefoim 
Use waveform samples as indicated in the text (assuming no noise) and ^venfy 
that the decoding logic (±A decodes as 0, zero decodes as 1) reproduces 

the data sequence. , 

3 55 Work Prob. 3-54 for a modified duobinary waveform where the Prefer 
uses a 2-bit delay and the decoder produces a 0 for sample values of zero 

and 1 for samples ±A. , • _ t;ll 

*3 56 Reduce T„ in Fig. 3.6-5 by 27* and show that the output waveform is stdl 
duobinary Assume precoding of the input {M sequence as in Fig. 3.6 3, 
except Ldth delay 2T„. The proof can consist of showing that the sequence 
{b k } given by . . ■ 100101110110010111000100 ... is correctly recovered. 
Samples ±A decode as 0 and zero decodes as 1. 

3-57. An impulse 8(t) can be considered as a limit as follows: 

8 (,) = limire ct Q 

where rect(-) and tri(-) are defined by (A.2-1) and (A.2-2), respectively. Use 
these results to prove that 


8 (t - r,)S(r - h - t 0 ) dt = 8 (t 0 - fi + l i) 


where t„ t 2 , and t 0 are arbitrary real constants . 

. 3 - 58 , By using the integral of Prob. 3-57 show that a(t) given by (3.7-1) has 
time-averaged autocorrelation function of (3.7-3). 

equal probabilities. Compare your result with (3.7-10). 
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3-60. Find the power spectrum of a polar NRZ signal having pulses 

m - s*[|(, - f) 

instead of the rectangular pulses of (3.6-2). Plot your result in the form of 
Fig. 3.7-2 and compare to the rectangular-pulse curve. 

3-61. Work Prob. 3-60 for the duobinary waveform. 

3-62. A modified duobinary waveform uses the pulse of Prob. 3-60. Find and plot 
the waveform’s power spectrum and compare to that in Fig. 3.7-3 for rectangular 
pulses. 

3-63. A bit interleaved multiplexer is to be designed to operate on a T1 line. Its 
output bit rate, therefore, is 1.544 megabits/s. The multiplexer’s inputs are 
all at the same average rate of 32 kilobits/s. (a) If the design is required to 
output 40 data bits for every control bit, applicable to each input channel 
(so M d = 40 M c ), find the number of input channels that will result in the 
smallest average number of stuff bits per control bit per channel, (b) What 
is this smallest stuff ratio? 


Chapter 4 

Baseband Digital 
Systems 


4.0 INTRODUCTION 

In Chap 3 most of the blocks involved in a typical communication system, 
as muTtrated in Fig. 1.3-1, were discussed. In this chapter we conOnue 
the discussions by concentrating mainly on the operation of receiver 

dCm °The demodulator is a critical system element, for it is ; here that the 
principal performance capability of the system is established It is the 
demodulator that examines the input stream of channel puhes wluch are 
partially obscured by channel noise, and determines which code digits are 
being conveyed in the received waveform. Such determinations always 
involve occasional errors because of the noise, and we usually seektofind 
optimum demodulators that minimize the probability of occurrence of errors. 

P Tthis chapter several forms of baseband digital systems are discussed. 
Some of these forms do not involve channel encoding or decoding and a 
capable of directly converting an analog message to a digital waveform for 
transmission. Delta modulation (DM), delta-sigma &-SU) k and adaptive 
delta modulation (ADM) systems have this form. As "° ted 1 “ 
these systems typically operate with sources having outputs that are correlated 
do, tto.fore, represent a class of source encode, not 

considered ™ “ a P f ba<iCband sysKms l0 be discussed, such as P“ lss “‘*' 
modulation (PCM) and M - ary systems, utilize the quantizing and encoding 
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functions of Chap. 3. In our discussions of these, the word system will 
refer primarily to the demodulation operation. 

It is appropriate to begin by defining an optimum binary digital system, 
since we usually attempt to realize near optimum results in practice. This 
initial work is relatively mathematical but should still be rewarding to those 
readers interested in the details of how such systems are optimized. Readers 
more concerned with applications of results and having less interest in 
theoretical developments can omit the initial work and go directly to Sec. 
4.4 but must be careful not to violate the assumptions and limitations that 
apply to the results. 

4.1 REQUIREMENTS AND MODELS FOR SYSTEM 
OPTIMIZATION 

The proper selection of an optimum system requires that at least three 
quantities be defined [1]: 

1. System constraints, 

2. Input specification, 

3. Optimization criterion. 

System constraints vary widely. The system might be constrained, 
for example, only to the class of linear physically realizable systems. In 
Other cases realizability may be of no concern or a nonlinear system might 
be acceptable. In the present case we shall require only that the optimum 
system be linear. Even though a nonrealizable system can result, it can 
always be approximated in practice and it can be used as a basis of performance 
comparison for other designs. 

Input specification requires that at least some knowledge be available 
about the receiver’s input waveform, which is usually the sum of a desired 
signal and undesired noise. As a rough rule of thumb, the more that is 
known about the inputs to the receiver, the more optimum a system can 
be in some sense. 

The optimization criterion is a measure of the goodness of a system. 
It is usually a quantity that can be either maximized or minimized; the 
optimum system then becomes that which produces the maximum or minimum 
measure. One type of optimum system is discussed in Appendix B and is 
called a matched filter. It is selected to maximize the ratio of output peak 
signal power, at some specified instant in time, to average noise power. 
The optimization criterion is therefore to maximize output peak signal-to- 
noise ratio. 

For the present problem we require the optimum system to be linear. 
Input specification and optimization criterion are defined in detail in the 
following subsections. 
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Basle Binary Systam 

Figure 4 1-1 is helpful in defining the overall system to be optimized. 
At the transmitter m represents a binary digital message sequence that can 
have either of two possible values in any one symbol interval They are 
m = m x if a binary 0 occurs, orm = m 2 if a 1 is transmitted. The symbo 
duration is denoted by T b , as usual, and is the same for all intervals of the 
sequence. In general, the corresponding transmitted waveform is represented 
by s(t) However, in any one symbol interval s(t) will always correspond 
to either the signal *,(. t) when m = m, or 5,(0 when m = Both 5,(0 
and 5 ,(f) are assumed arbitrary, except that they are real and nonzero only 
in the symbol intervals in which they are transmitted; that is, their duration 
is L, The receiver (demodulator) is assumed to know the forms of 5,(0 
and 5,(0, but it does not know which is transmitted in a given symbol 
interval because it has no knowledge of which message (m, or m 2 ) has been 
originated by the source. 



Figure 4.1-1. Basic binary digital system. 


Noise Model 

The input waveform to the receiver, denoted generally by K0> > s 
assumed to be the sum of the transmitted signal and white, zero-mean, 
Gaussian noise, denoted by n w (t). The noise power density spectrum is 
denoted by Jf 0 /2 as usual, and applies to frequencies -» < ® 

It is helpful to think of the white noise as resulting from the limit of 
ideally bandlimited white noise as its bandwidth becomes infini e. 
bandlimited noise, denoted by n b (t), has a power spectrum 

y„(«> - (f) r “(^] <4 ' w> 

where W N is its bandwidth (radians per second). By inverse Fourier trans- 
formation of Sfjw) the corresponding autocorrelation function l 

fU T ) = ^Sa(VV„T). (4-1-2) 

Figure 4.1-2 illustrates the behavior of ^ nft (®) and f?m,( T ). 


i 

! 

: 

i 
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(b) 

Figure 4.1-2. (a) Power spectrum and (b) autocorrelation function of band- 
limited white Gaussian noise. 


Samples of n b (t) taken A T apart at times 


t k = kAT, k = 1,2,.. 

are uncorrelated if A T is chosen as 

.,K 

(4.1-3) 

t> 

II 


(4.1-4) 


because the noise has zero mean and its autocorrelation is zero for sample 
time separations that are multiples of tt/W n . Furthermore, since the noise 
is assumed Gaussian, these samples are statistically independent. If the 
variance of noise samples is denoted by cr 2 , then cr 2 is given by 

o’ 2 = E [n 2 b (t)i = RJ0) = (4.1-5) 

Alternatively, 

(7 2 AT = y. (4.1-6) 

Next, we observe that the joint probability density function of all K 
noise samples, denoted by p„ 4 (n,, n 2 , . . . , n K ), is the product of the 
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densities of the individual noise samples, denoted by p nk (n k ), because samples 
are independent. Therefore, because samples are Gaussian, we have 

pj/ti, n 2 , ..., n K ) = FI PnMk) = ex p[- 2 (4J ' 7) 

The above noise model constitutes our specification of the receiver’s 
input noise. Choice of this noise model greatly facilitates the problem of 
finding optimum receivers. If the K noise samples occur during one symbol 
interval T b , then 

K = It = (4.1-8) 

AT TT 

independent noise samples are available. As W, v — * 00 such that n b (t) — » 
n w {t), white noise, the number of samples becomes infinite and AT -> 0. 
Later we shall find that K->°° allows some summations to be approximated 
as integrals that yield the optimum systems, which, of course, is why the 
noise model is used. 


Signal Model 

Let r(t), and s 2 (t), evaluated at sample times /*, k = 1,2,..., 
K, be defined as follows 

r k - r(t k ) (4.1-9) 

s lk = Si(t k ) (4.1-10) 

5a = s 2 (t k ). (4.1-11) 

Here {r*} is a set of random variables representing samples of the receiver’s 

input waveform. They are random because of the channel noise. 


|$ u + «*, if m t sent 
I s 2k + n k , if m 2 sent. 


(4.1-12) 


Thus n k = r k - s ik , i = 1 or 2. The joint probability density of the random 
variables {/-*}, conditional on m, or m 2 being sent, is 

K 

pM i > r 2 , ..., r f:\tn,) — Pmk r k — s ik) 


= (2wV" /2 exp{-|^^},i= 1,2. 


v (4.1-13) 

This discussion constitutes the specification of the input signal. It 
remains to define the criterion of optimality. 
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Optimization Criterion 

At appropriate times, once for each symbol interval, we shall require 
that the receiver demodulator decide which waveform, s,(r) or s 2 (t), was 
transmitted, t Of course, such decisions amount to deciding which of the 
two binary messages, m, or m 2 , was transmitted. Thus the receiver is 
basically a decision device; if its decision is optimum in a typical interval, 
it will be optimum for other intervals as well. This fact allows us to examine 
the optimization problem for only one (typical) symbol interval. For con- 
venience, we assume the interval exists from t = 0 to t = T h , 

The receiver can only work with its input waveform, rU). Its decisions 
must be made on the basis of observations of r(t). We shall assume these 
observations are in the form of samples taken closely in time, AT apart. 
As the number of samples becomes very large (— ► t») and AT becomes 
very small (— > 0), the samples are equivalent to observing the continuous 
waveform r(t). 

Let K samples be taken at times t k - kAT, k = 1, 2, . . . , K, where 
T b = KAT. We choose AT according to (4.1-4). These choices allow the 
noise and signal models described above to apply. Now suppose the receiver 
observes a set of specific values, denoted by {p k }, of the sample random 
variables {r*} during a given symbol interval. We seek to find how a receiver 
should process these values to optimally decide which message was trans- 
mitted. Next, suppose the receiver could somehow calculate two probabilities: 
first, the probability that message m, was sent given that the specific set 
of observations {p k } was obtained and second, the analogous probability 
that m 2 was sent.i Denote these probabilities by P(mi\p u . . . , p K ) and 

Pim 2 \pi p K ), respectively. They are called a posteriori probabilities. 

An obviously reasonable decision as to whether m, or m 2 was transmitted 
is: If 

^("i 2 |pi Pk) > Tfmjp, , ..., p A -), choose m 2 , (4.1-14) 

and choose m , otherwise. 

In fact, (4.1-14) becomes our optimization criterion. The system that satisfies 
(4.1-14) is the optimum system. 

Our criterion can be put in a more convenient form by observing that 
[see (B.l-3), Bayes’ rule] 

P(m,\p u ..., p K )P(p u ..., p K ) = i P(p,, ..., p K \m,) Pirrii) (4.1-15) 
for i = 1 and 2, where P(m ( ) is the probability that message m, was transmitted. 

t This is called bit-by-bit signaling. 

} The receiver does not actually compute these probabilities; it is only helpful to imagine 
it does. The actual receiver is equivalent to having made such calculations. 
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Also, since 

F(p,, .... PatK) = PriPi, • ••> PxK) dp x dp 2 ■■■ dp K (4.1-16) 
for i = 1 and 2, we can write the optimality criterion as 


Pr (Pi , -.PkN fQ»l) 
Pr(Pi, ....PatIw,) F(m 2 )’ 

and choose m, otherwise. 


choose m 2 , 


(4.1-17) 


4.2 OPTIMUM BINARY SYSTEMS 

The optimum binary system is one that satisfies the decision rule of 
(4.1-17). Define message probabilities (called a priori or source prob- 
abilities) by 

P x = P(nti) (4.2-1) 

P 2 = P(m 2 ) (4.2-2) 

and substitute (4.1-13) into (4.1-17). The inequality becomes 


tel. 


[2 Pk( s 2k s l/k) S 2k + S lk\ 


H 


(4.2-3) 


Because the exponential is monotonic in its exponent, we can take the 
natural logarithm in (4.2-3) to obtain an equivalent inequality: 


X p(kAT) [s 2 (kAT) - s x (kAT)]AT - X [^i(*AT) 

*-i *-> 

- yftfcAT)] > ° JA7 ' ln (p^) ’ 

where we substituted 

Pk = P(h) = P(MT) 
su = si(t*) = s x {kAT) 

Szk = s 2 (tk) = s 2 (kAT). 

In the limit as K bcomes large and AT becomes small, 

A T-kdt 
kAT -► t, 

(4.2-4) can be written as 

r n .... £,-£,+ jV 0 ln(P,/P 2 ) 

J r(t) [s 2 (0 - Ji(/)] dt > • 

Here _ Tb 

Ei = I s-(t) dt, i = 1 and 2, 


(4.2-4) 

(4.2-5) 

(4.2-6) 

(4.2-7) 

(4.2-8) 

(4.2-9) 

(4.2-10) 
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are the energies in signals $,(/). In writing (4.2-10) we have used (4.1-6) 
and replaced p(t) by r(t), since pit) is only a specific realization of rit). 
Equation (4.2-10) then represents the general operation on r(t) that is performed 
by the optimum receiver. 

By defining a threshold, V T , as 



(4.2-12) 


(4.2-10) allows the system’s decision rule to be written as: 


r 

If I r(/)[i 2 (0 - $i(/)] dt > V T , choose m 2 , 
-'o 

and choose m, otherwise. 


(4.2-13) 


Correlation Receiver Implementation 

Since the receiver knows the forms of the waveforms s,(r) and s 2 (t) 
that are used by the transmitter, it can generate these waveforms and 
implement the test of (4.2-13), as shown in Fig. 4.2-1. The input waveform 
in a given symbol interval is multiplied by replicas of the two waveforms 
Si(t) and i 2 (/) and integrated over the symbol interval. The threshold V T 
is subtracted from the difference of the integrals and the resulting signal is 
sampled at the end of the interval. If this sample, denoted by D, is greater 
than 0, the receiver decides m 2 was transmitted. If D < 0, it decides in 
favor of m u and if D = 0 a random choice between /n, and m 2 is made. 
At the end of each symbol interval the integrator is discharged (reset to 
zero) in preparation for integration over a new interval of duration T b . The 
process is repeated in each symbol interval. t 


Matched Filter Implementation 


Consider the responses of filters matched to $,(/), i = 1 and 2, when 
r(t) is applied at their inputs. The matched filters’ impulse responses are 
[see Sec. B.9]^ 

h,(t) = s,(T„ - 0, i = 1 and 2, (4.2-14) 


where j,(/) are nonzero only for 0 =£ / =s T b . Filter responses, denoted by 
r oi (t), are 


r oi (t) = I r (0 h,it - {) d£ 


(4.2-15) 


r(t)Si(T b - t + t)dt, i = 1 and 2. 


t Note that symbol synchronization is necessary so that samples are properly timed 
in the optimum system. 

t The constant r 0 in Sec. B.9 has been set equal to T b to make h,U) causal. 


ISO 
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r o2 (‘) = s o 2 < ( ) + «oj(0 


r 0 (t) = s 0 (t) + n B (t) 



£, f D > O, Choose m 2 

►- i D - 0, Random Choice 

LD<0, Choose m, 


' T Sample 
at t = T h 


r o\ (0 = s 0 i W + "„i (0 

Figure 4.2-1. Optimum binary digital demodulator and decision in correlation 
form. 

At the output’s sample time t = T b , 

r ol (T b ) = f r(0s,(£) d£, i = 1 and 2, (4.2-16) 

which are the same as the integrator outputs of Fig. 4.2-1 . Thus we conclude 
that the product-integrator cascades are equivalent to filters having the 
impulse responses of (4. 2-14). t These matched filters are also defined by 
their transfer functions 

H,(o)) = S*(w)e~ J “ Tb , i = 1 and 2, (4.2-17) 

where 

s,(t) *-* S,(o>), i = 1 and 2. (4.2-18) 

Figure 4.2-2 illustrates the optimum receiver, using matched filters, that is 
equivalent to the correlation receiver of Fig. 4.2-1. 

Optimum System Output Noise Power 

Let n 0 {t) represent the noise of the optimum receiver’s output, as 
shown in Fig. 4.2-1. If n a is the random variable representing w„(0 at the 
sample (decision) time T b , then 

n a = n 0 (T„) = f njt) [s 2 (t) - 5,(0] dt. (4.2-19) 

-'O 


t The reader should note that the matched filter and product-integrator (correlation) 
receivers are equivalent only at the sample time at the end of the bit interval. 
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Filter Matched 




r„(t) = s a U) + «„(r) 
1 + \ > 


D > 0, Choose m 2 

D = 0, Random Choice 

D< 0, Choose m. 


H. fro') 





Filter Matched 
to 5,(0 


r ol (t) = s ol (t) + n ol (t) 


Figure 4.2-2. Optimum binary digital demodulator and decision in matched filter 
form. 


The output noise power (variance) on the decision sample, denoted by 
<rl, is 


aj = E[n 2 0 ] = E {f o n Ml 

, r r~ 

Jo Jo 2 


r 

I n w \ 

Jo 


(0fo(0 - •SiW] dt I n w (a) [s 2 (o:) - 5,(a:)] da 


8(a - f)[i 2 (0 - ■Si(0][i 2 (a) - .v,(a)] dadt 


r 

2 J 0 


[s|(0 + s ?(/) - 25,(0 s 2 (0] dt. 


(4.2-20) 


The first two terms are the energies E, and E 2 in 5,(0 and s 2 (t) given by 

E, = f 5?(0 dt (4.2-21) 


« / 0 

E 2 = f sftt) dt. 

Jo 


(4.2-22) 


If we define a quantity y, which is a measure of the correlation between 
j,(0 and 5 2 (0, by 

1 r n 

y = -f= / 5,(0i 2 (0 dt, (4.2-23) 

\/E,E-> Jo 


\E x Ei Jo 

then the output noise power of the optimum system becomes 


2 _ *0 

° ~ It 


[£, + E 2 - 2y\/E ,£ 2 ] . 


(4.2-24) 
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Optimum System Output Signal Levels 

Let V, and V 2 represent the signal component, s 0 (t), in the output of 
the optimum system of Fig. 4.2-1 at the sample time t = T b when m, and 
m 2 are transmitted, respectively. These are 


V 2 = s a (T b ) 


V, A So (T„) 


= f s 2 (t)[s 2 (t) - i,(t)] dt 
= E 2 - y\/Efi 2 , m 2 sent 

= f Si(t)[s 2 (t) - Si(0] dt 

-'O 

= y\fE x E 2 - m x sent, 


(4.2-25) 


(4.2-26) 


where (4.2-21)-(4.2-23) have been used. These signal levels, taken at the 
sample time, are needed in computing error probabilities. 


4.3 OPTIMUM BINARY SYSTEM ERROR PROBABILITIES 

Because of noise, the output r 0 (i) of the optimum receiver is random. If 
message m x is transmitted, this output at the sample time t = T b has a 
Gaussian probability density with mean value V x . Transmission of message 
m 2 leads to another Gaussian density, except this one is centered at a mean 
value V 2 . These densities are illustrated in Fig. 4.3-1 where p, 0 {r„\m,), i = 

1 and 2, is the density of r a = r a (T b ) given m, is transmitted. It follows 
that 

P*(r>/) = pjr a - V,), / = 1 and 2, (4.3-1) 

where p„„i ) * s the probability density function of the output (Gaussian, 
zero-mean) noise having variance <r\ given by (4.2-24). 

The receiver makes decisions by comparing the output r„ with V T \ if 



Figure 4.3-1. Probability density functions of optimum system output at the 
decision (sample) time for transmission of the two messages m, and m 2 . 
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fo > V T , it decides m 2 was sent. However, if r 0 <V T and m 2 is sent, an 
error is made; its probability is 


rv T 

P(e\m 2 ) = J PrS- r o\m 2 ) dr a 


(4.3-2) 


Pjr a - V 2 ) dr a . 


In a similar manner, if r a > V T and m, is the correct message, an error is 
made having probability 


’ll) = / pjrjw,) dr 0 

~'Vt 


(4.3-3) 


PnS r o ~ V x )dr a . 


By averaging over the two possible errors, we obtain the average probability 
of error, denoted by P e , for a given symbol interval (one bit in this case). 
After using the fact that output noise is Gaussian with variance crl , P e can 
be written as 

P e = P(e\m x ) P , + P{e\m 2 )P 2 


= |erfc 


V2 (r 0 ) 2 V V2 cr 0 / 


(4.3-4) 


where erfc(-) is the complementary error function of (B.4-6). On use of 
(4 . 2-24)— (4 . 2-26) and (4.2-12) we obtain 


Pe = ~ erfcl / E ' + El - 2y \/e x E 2 yCv^ \n(P l /P 2 ) ) 

4Ji ° 2 Ve, + E 2 - 2y\/EE 2 1 

+ ^ erfcl / £ - + E 2 - 2 yV^E 2 V^ln (P x /P 2 ) ] 

4J{ ° 2 Vf, + E 2 - 2y\/EE 2 J 

(4.3-5) 

This expression for P e is valid for arbitrary waveforms 5 ,(t) and s 2 (t) and 
arbitrary choices of P t and P 2 . Some simpler expressions follow some 
realistic assumptions, often true in practice. We take three cases. 

Equal Probability Massages 

If P x = P 2 = i, we have 

P* = \ erfej j. 


(4.3-6) 
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Because (4.3-6) still applies for arbitrary waveforms s,(t) and s 2 (f), it can 
be further minimized if we sacrifice this generality by selecting one, say 
j 2 (f), to be a suitable function of the other waveform Si(t). Since erfc(-) is 
a monotonic function of its argument, P e is a minimum when 

£, + E 2 - 2y Ve^E 2 = £, + E 2 - 2 | s x (t)s 2 (t) dt (4.3-7) 

-'O 

is maximum. Suppose we maximize (4.3-7) under a constraint of constant 
total energy 

£ T0X = £, + E 2 = constant. (4.3-8) 

The maximum will occur when the third (integral) term is maximum. From 
Schwarz’s inequality 

12 r Tt 

.2 


2 f Si(t)s 2 (t)dt 

Jq 


(4.3-9) 

(4.3-10) 

(4.3-11) 

(4.3-12) 


=s4 f s](t)dt f s 2 (t) dt = 4E X E 2 . 

Jo Jo 

The equality holds only when 

s 2 (t) = Ts^r), 

with T a real constant. Squaring and integrating (4.3-10) gives 

r-M 

From use of (4.3-10) and (4.3-11), we write (4.3-7) as 

J -r* 

s,(t)5 2 (t) dt = £ tot •+• 2 \JE x E 2 

0 

= T^tot 2 \/E { (E lm — £,). 

The derivative of (4.3-12) with respect to E x is zero for E x = E T0T /2. The 
second derivative of (4.3-12) with respect to £, is negative only for the 
lower signs in (4.3-12) when E { = E lm /2. Thus we conclude that (4.3-7) 
is maximum only when 

E, = £ 2 = (4-3-13) 

s 2 (0 = r j,(/) = - *,(/)• (4-3-14) 

It is left for the reader as a simple exercise to show that the above 
developments are equivalent to the maximum of (4.3-7) occurring when 
E t = E 2 = £ T ot/2 and y = - 1 . Our developments lead to two additional 
cases. 

Equal Probability, Equal Energy Signals 

Define £ as the energy of each signal with 

£ 4 £, = e 2 = 


(4.3-15) 
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P e of (4.3-6) reduces to 

/£(! - y) 
V 2jV 0 
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Equal Probability, Antipodal Signals 


When s 2 (t) = -s x {t), the signals are called antipodal. 
leads to the minimum value of (4.3-6) as discussed above: 



This choice 


(4.3-17) 


Equation (4.3-17) is also obtainable from (4.3-16) by letting y = - 1. 


4.4 BINARY PULSE CODE MODULATION 

Binary pulse code modulation (PCM) is the name used to describe the 
process by which an analog message is converted to a binary digital waveform. 
The process consists of first sapling the analog signal according to the 
principles described in Chap. 2 so that the analog message can be recovered 
from its samples in the receiver. Each sample is then quantized to the 
nearest of a finite number of quantum levels and coded (often using the 
natural binary code). Finally, a waveform format is assigned to the code. 
The resulting PCM waveform is processed through the digital system. In 
this section we shall describe the basic functions performed by a PCM 
system and discuss its performance with several example waveform formats 
based on rectangular pulses. 

Overall PCM System 

Many ways exist for implementing PCM systems. Figure 4.4-1 illustrates 
one method that assumes N similar analog messages are time multiplexed. 
Each message is lowpass filtered to assure bandlimiting and is then sampled 
to generate N time-multiplexed sampled waveforms, each in its assigned 
time slot. The composite signal is compressed, if companding is used. 
Samples are next quantized, source and channel encoded, and finally converted 
into appropriate waveforms by the modulator. Although the transmitted 
waveform can have any of the formats discussed in Chap. 3, we shall limit 
our discussions to only unipolar, polar, Manchester, and differential formats, 
because these waveforms all have optimum receivers easily put in the form 
of those developed in Sec. 4.2. 

Naturally, many variations are possible for the transmission system 
of Fig. 4.4-1. In some cases the channel encoder, the compressor, or both 
can be left out. If the composite message source is digital, such as one 
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with only discrete levels, for instance, it can be handled by accepting it as 
an input at point A. Digital sources already source encoded can be input 
directly to point B. By suitable multiplexing, combinations of these types 
of inputs can be handled. 

The receiver necessary to recover messages from the PCM signal 
basically performs the inverse operations of the transmitter. Figure 4.4-2 
illustrates the receiver based on the transmitter of Fig. 4.4-1. The receiver 
becomes optimum when the demodulator has the forms developed in Sec. 
4.2. 

Unipolar, Polar, and Manchaatar Formats 


For example, a unipolar format corresponds to s,(t) = 0, s 2 (t) = A 
for 0 *£ / « T b and the optimum demodulator of Fig. 4.2-1 reduces to that 
shown in Fig. 4.4-3(a). From (4.2-12) the optimum threshold V T is 



unipolar format, 


(4.4-1) 


since the energy, E 2 , in s 2 (t) is 

E 2 = A 2 T„ (4.4-2) 

for the assumed rectangular pulses. 

For a polar format where s,(t) = -A and s 2 (t) = A, both for 0 *£ 
t T b , the optimum demodulator has the form shown in Fig. 4.4-3(b), 
where 



polar format. 


(4.4-3) 


E x =E 2 = A 2 T„. 


(4.4-4) 


When the two binary symbols are equally probable, the threshold is zero. 
The polar waveform is a special case of antipodal signaling. 

With a Manchester waveform the transmitted waveforms s,(t) and s 2 (t) 
are also antipodal. Here s 2 {t) is easily generated from a periodic square 
wave clock that generates a level A for 0 « t < TJ2 and -A for T b /2 < 
t 7*. Its fundamental frequency is, therefore, w b = 2n/T b . By inversion, 
the clock generates s,(t) = -s 2 (t) as well. The optimum demodulator 
becomes that shown in Fig. 4.4-3(c) where V T is given by (4.4-3). In a 
practical realization a single integrator could be placed after the left summing 
junction for a hardware savings. 

All the optimum demodulators of Fig. 4.4-3 have equivalent filter 
realizations that are easily derived from Fig. 4.2-2. 


Error ProbabilitiM 


The error probabilities of Sec. 4.3 apply to the demodulators of Fig. 
4.4-3 because they are optimum. If we define s as the average energy per 
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Figure 4.4-3. Optimum demodulators for (a) unipolar, (b) polar, and (c) Manchester 
waveforms. 


symbol (bit) interval divided by twice the channel’s white noise power 
density [2(JV 0 /2)], then 


P 2 E P 2 A 2 T h 

unipolar format 


bf , => 

II 

polar format 

(4.4-5) 

ii 

Manchester format 



where E is the energy in a transmitted pulse of duration T b and amplitude 
A. From (4.3-5), the average probability, P e , of incorrectly decoding a bit 
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(symbol) becomes 



unipolar 

format 


(4.4-6) 


polar or 

Manchester 

format. 





e in dB 


I 

! 


4 . 4 . 4 . E™, pro»W <or »• “ 4 

unequal message probabilities. 
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the polar and Manchester cases P e is not a sensitive function of P JP for 
0.1 =s P\/Pi « 10 and does not change greatly even over 0.01 =s 
P l /P 2 s 100. For example, with P\/P 2 — 100 and P e = 10 5 , the increase 
in power required, relative to the P\/P 2 = 1 case, is less than 1.0 dB. The 
difference becomes even smaller for smaller values of P e . However, P t 
for P\ / P 2 is always smaller than when P\ = P 2 = i ■ This behavior does 
not occur in the unipolar case where one waveform’s energy is zero. For 
the often- satisfied case where P, — P 2 = i, we obtain 



unipolar format 

polar or Manchester format. 


(4.4-7) 


These two functions are plotted in Fig. 5.12-1 for comparison with the 
performance of various systems. For a given value of P e the required value 
of e is 3 dB larger for the unipolar format compared to the polar or Manchester 
format. 


Effect of Differential Coding 

If the binary source is differentially encoded, any of the binary waveform 
formats can be used, and the receiver is selected to match the format. For 
unipolar, polar, or Manchester formats, this means that either (4.4-6) or 
(4.4-7) still applies for the average probability of making an error in de- 
modulating any given bit. However, because 2 bits (symbol intervals) are 
involved in a differential decoder (Fig. 3.6-3(c)), the decoding function 
degrades overall error probability. 

Consider the decoder. First, suppose the currently arriving bit is 
correct; an error in output is made only if the adjacent earlier bit was in 
error. The probability of this joint event occurring is P f (l - P e ). Next, 
suppose the current bit is in error; the output is in error only if the adjacent 
earlier bit is correct, t Again this joint event has probability F e (l - P e )- 
The overall probability of a decoded bit being in error, denoted by 

-Fe(diff) ' IS 

P e(i>m = 2P'(l - P e ). (4-4-8) 

If P e is small, 

P etdifn ~ 2 . (4.4-9) 

The approximate effect of differentially encoding data in a PCM system is, 
therefore, a doubling of the probability of bit errors. This increase in error 
probability is considered small in most applications and the advantages of 
differentially encoded data more than justify its use. 


t Both bits in error will result in a correct output. 
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Finite Channel Bandwidth 

The transmitted rectangular pulses arrive at the receiver undistorted 
only if the channel has infinite bandwidth. In a practical system a typical 
rectangular transmitted pulse of duration T b will arrive at the receiver with 
nonzero rise and fall times. Because of the fall time, the pulse will spill 
over into the following symbol interval. The spillover, when integrated by 
the receiver (Fig. 4.4-3), will cause a change in the output level at the 
interval’s sample time compared to its level without spillover. The rise 
time also affects the output level. Interference between symbol intervals 
in this fashion is termed intersymbol interference, a subject covered further 

in Sec. 4.6. , . , , , 

A rough idea of the effect of a real channel can be obtained by 

assuming it has a simple first-order lowpass transfer function 

*„(») - [i + (£)]"• 

where W ch is the channel’s 3-dB bandwidth (radians per second). Now 
consider a typical level transition in a polar PCM transmitted waveform, 
for example, from -A to A; the channel response is illustrated in Fig. 
4.4-5.t A straightforward calculation of D in Fig. 4.4-3(b), when assuming 
p, = p 2 = J and no noise, gives 



The ideal output is 2A 2 T b . The bracketed factor represents degradation 
due to finite channel bandwidth. If the degradation factor is to be maintained 

t For simplicity, level -A is assumed to have occurred for several symbol intervals 
so the transition starts at - A. 
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at 0.9, or better, W ch must not be less than 20/7), = 1 0o>*/7r = 3.2 w b . 
From Fig. 3.7-2 this bandwidth corresponds to passing two full sidelobes 
in the power spectrum of the polar PCM signal. In Sec. 4.6 we show that 
transmitted pulses can purposely be shaped to reduce both intersymbol 
interference and the need for large channel bandwidth. 


4.5 NOISE PERFORMANCE OF BINARY SYSTEMS 

When a binary system operates with analog signals (PCM) where the receiver 
must reconstruct the original continuous message, the measure of noise 
performance is the ratio of system output signal power to noise power. 
When the system operates with digital messages, it is the average probability 
of making errors that is most important. We consider this second situation 
first. 


Performance for Digital Messages with Coding 

In many cases a group of k binary digits may represent a “word” of 
data, such as with natural binary source encoding when k equals N b , the 
number of binary digits per word. Here no channel encoding is present, 
and we call this uncoded signaling. The probability of a word error, denoted 
by P w , may be more important than P e , the probability of a bit error. A 
word error occurs if any one or more of the k bits are in error; this 
probability is 1 minus the probability that all bits are demodulated correctly, 
as given by 

P„ = 1 - (1 ~ Pf (4.5-1) 

= kP r , P e « 1. 

In the case where the data word is coded using, for example, a 
r-error correcting (n, k) code, no error is made unless t + 1 or more of 
the n total bits in a word are in error. Probabilities involved here follow 
the binomial distribution. The probability of exactly i bit errors in n bits 
(any order) is (?) P‘ e { 1 - PA" - ', so the probability of t + 1 or more bit 
errors becomes 


P„= 1 


P‘( 1 - PeT 


(4.5-2) 


n\P'r 

(t + 1)! (n - t 


Of course, P e here is the average probability of any one bit being in error.) 

t Equation (4.5-2) approximates the sum by the first term. However, if n is large it 
may be necessary to include additional terms for the approximation to be accurate. 
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p t i n (4.5-2) does not necessarily have the same numerical value as P € of 

(4 5 To compare coded and uncoded systems, assume both use the same 
total time allocated to a codeword and the words have the same number 
(k) of data bits. These assumptions mean that both T„ and e in the coded 
system are smaller than T„ and e in the uncoded system, respectively, 
because the coded system must transmit its extra n k check digits. In 
the two systems P w follows from (4.5-1) and (4.5-2) by using the appropriate 
s (see (4.4-5)) with the appropriate expression for P t . For a polar tormat 
with equal word probabilities, for example, we obtain 

p w = (^j erfc(Ve), no coding (4.5-3) 

p _Hl i erfc( /— j , with coding. (4.5-4) 

w (t + l)!(n - t - 1)! L 2 \V « / J 

Here e refers to the uncoded system and is assumed larger than about 4_ 
System comparison usually entails plotting P„ versus e for given n, k, and 
t. The difference in required values of e for equal values of P w is called 
coding gain. For large enough e, coding gain is positive (e required in 
coded system is less than e required in uncoded system). For small enough 
e , coding gain can be negative so that coding actually degrades performance. 
To illustrate these points we conside r an example. 

A^now^ perfect code is the (23, 12) Golay code, where t ■ - : 3 [3] . We 
F to evaluate the complementary error functions of (4.5-3) and (4.5-4). Figure 
1.5-1 Plots the word probabilities. At P „ = 1<T« the coding gam is approximately 

2.6 dB. _ 


Performance for Analog Messages Above Threshold 
(PCM) 


Because of quantization, only the quantized version /,(/) of an analog 
message, /(f), can be recovered by the receiver, even if there is no noise. 
As in Chap. 3 we represent///) as /(f) with quantization error e q (t) as 

/,(/) = fit) + e/f). (4 ' 5 ' 5) 

When noise is negligible, the system is said to be above threshold and 
performance is measured by the signal-to-quantization-noise power ratio 


X\ _ fw 

J PCM e 2 q(t) 


(4.5-6) 


We determined (SjN q ) in Chap. 3 for several forms of ^ 

shall not repeat those results here. However, as an example we summarize 



Figure 4.5-1. Word error probabilities for coded and uncoded systems. 


one important case. For messages that have symmetrical fluctuations about 
zero and have an extreme value, denoted by |/(f)| max . a uniform quantizer 
having a large number L of levels produces 

=(^)4M-. (4.5-7) 

\nJ \K l 


Here/ ?(/) is the message’s power when at the reference level that establishes 
the quantizer’s step size, Su, according to 

Sv = (4.5-8) 

The message’s crest factor is K cr , defined by 

„ 2 _ |/(0lLx 

/V) 


(4.5-9) 
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Equation (4.5-7) applies for 7*0) when TU) > JW), amplitude overload 

occurs (Chap. 3) a nd (5 0 /N,) PC m decreases. 

We'calculate (4.5-7) for a sinusoidal message of peak amplitude 2.7 V *n > 
128-level uniform quantizer is designed for a maximum amplitude of 3.. V. Here 
TiTTj = (2 7) 2 /2 and fW) = (3.2) 2 /2, whereas K] r = 2 for sinusoidal signals. Thus 
(SJN'h cm = 13(128)72] (2.7/3. 2) 2 = 17,496.0 (or 42.4 dB). 


Effect of Receiver Noise in PCM 


When receiver^ noise is not negligible, we may write the recovered 
signal, denoted by fff), as 

Ut) = m + = m + (4 - 5_10) 

In other words there is now an error e„(r) in the reconstructed message 
/ (f) due to noise. Because the quantization and receiver e, /ors anse from 
different mechanisms, they may be taken as statistical y in epe 
the total output waveform power is the sum of mdivi ua pow 
Define quantization and receiver error noise powers by 

N, = Ofi) (4.5-11) 

N rec = ^(0. (4 ' 5 ' 12) 

These allow output signal-to-noise power ratio to be defined by 


X\ = ^0 

flo) VCM + 


(4.5-13) 


The presence of N„ c gives rise to a threshold effect in PCM. 

Ou“£° problem b to ted J*«. To this end. we must relate mpot 
noise to how it causes errors in the output. The relationship is found from 
a close look at the behavior of the demodulation and decoding operations 
of Fig 4 4-2. At the end of each word, the source decoder generates an 
exact quantum level corresponding to the recovered codeword (we consider 
only the case where no channel encoding is used). The recovered codeword 
and therefore /(f), will contain error only if an error is made in indi vidua 
“to “Sons The effect of r.cetver noise is to cause oecastonal false 

“'“thus will possess a noise error if a codeword is in error -. add a 
codeword is in error if any one or more bit errors are made. Most PCM 
systems operate such that it is highly unlikely that more than one bit is in 
errorTn any one word. By assuming this to be the case, we examine the 

effect of bit errors on a bit-by-bit basis for an Ni,-bit wor . 

If an error occurs in the least significant bit, an error in ffi) of 8u 
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occurs. If such an error occurs in m words out of M possible words the 
average (mean-)squared error is (Svfm/M. For a large number of words 
m/M is interpreted as the probability P e of the least significant bit’s being 
in error. Average-squared error in f q (t) is then (Sv) 2 P e . 

For the next least significant bit, a bit error causes an error in /,(/) 
of 2(8u). The mean-squared error becomes (2bv) 2 P e . Continuing the logic 
to the ith next least significant bit, the mean-squared error in /(f) is 
(2 1 &v) 2 P e . Now, recognizing that the total mean-squared error e 2 (7 in 
f q {t) is the sum of the contributions from each bit in the code word, we 
have 

Nb-l *2Nb _ i 

N rec = e 2 n (t) = (fv) 2 P e X (2/ = (Sv) 2 P e r . (4.5-14) 

(=0 3 

Finally, since 2 1Ni » 1, we substitute (4.5-8) and obtain approximately 


N rec = 


4j/r(f)|max 


(4.5-15) 


To complete the calculation of N m , we may substitute the expression for 
P e given by (4.4-7). For polar PCM with P, = P 2 = \ we get 


2|/Xf)|Lx2 2 ^ 

Nrec = 3L 2 



(4.5-16) 


A similar result is achieved for unipolar PCM. 


Performance Near Threshold in PCM 

From (4.5-13), using (3.2-4) with N q , 0 i = 0 since we assume f 2 (t) =£ 
/ 2 (0, (4.5-14), and the fact that S 0 = f 2 (t), we have 

polar NRZ PCM. (4.5-17) 


This expression can be put in a more useful form by observing that the 
demodulator of Fig. 4.4-3(b) applies with 37 = 0 from (4.4-3). The peak 
power in the signal component of the waveform at the sampler, at the 
sample time, denoted by 5, , is 

Si =V\=V\ = 4(A 2 T b f (4.5-18) 

from (4.2-25) or (4.2-26) because y = — 1 and E x = E 2 — A'T b . The 
average noise power at the input to the sampler, from (4.2-24), denoted by 
N„ is 



JV, = or l = 2^o A 2 T b . 


(4.5-19) 
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By defining sampler input peak signal-to-noise power ratio as 


(iL 

we may write (4.5-17) as 


2 A 2 T b 


polar NRZ PCM, 


(4.5-20) 


1 1 / M ] 

2 \Ni) pcm_ 


polar NRZ PCM. 


L\ 2 ( 4.5.2l) 

Of course, (S./N.W is Siven by (4.5-7). A 

Fi(T 4 5-2 for a message at its maximum no-overload level, J ( t) JrU 
The threshold effect is clearly evident and occurs for (SJN,) pcm approximately 
fr^m in to 15 dB for Nh = 2 to 10, respectively. 

Another useful form of (4.5-17) can also be obtained. We write 

A 2 T b _ A 2 N b T h /T s m t 2£SA Wl, (4.5-22) 

M a (Jf 0 W f /2ir) 2rrN b \X 0 Wf) 2ir ^- 

where the input average signal power per message to the receiver (from 
the channel), denoted by 5,, is 

c _ N *’ Tb (4.5-23) 

o, - j- ’ 

“<* r. is .b« 

written inTrms of a baseband system with the same average transmitted 
power, the same values of X 0 and W f , we have 

A 2 T b 1 / So] (4.5-24) 

2 N b \N, 


Hence (4.5-17) becomes 

(£) 

\N 0 J PCM 


{Sj NqjfCtA 


polar NRZ PCM. 


‘ + _ H5-75, 

Figure 4.5-3 illustrates plots of (4.5-25) using (4.5-7) when f\t) = 


t In general T + NJ b because many PCM signals may be multiplexed in time For 
similar^message^ushig the same word lengths and number of b.ts, there are N - TJN* 
time slots available in multiplexing. 
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Figure 4.5-2. Normalized performance of a binary polar NRZ PCM system when 
the analog message’s samples are encoded with N b - bit binary words. The abscissa 
is also equal to 2e in decibels [2]. 


Threshold Calculation 

The point of threshold will arbitrarily be defined as the point where 
( SJN 0 )pcm drops 1 dB below the value of (SJN Q ) ?C M - From (4.5-21) this 
occurs when the denominator becomes 1.26. Solving for the threshold 
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also equal to 2 N b e in decibels [2]. 


: 

i 


signal-to-noise ratio (S,/N,)pcM.f* §' ves 

ii) =2 

Nj/ pcM.l/i 

Figure 4.5-4 illustrates the behavior of (4.5-26). 


=rfc-'(P)J. polar NRZ PCM. <4.5-26) 



Figure 4.5-4. Threshold peak signal-to-noise ratio of a binary polar NRZ PCM 
system [2], 


4.6 INTERSYMBOL INTERFERENCE 

In Sec. 4.4 we found that insufficient channel bandwidth caused pulses 
arriving at the receiver to spread into adjacent pulse intervals. This spillover 
caused errors in the signal component of the waveform sampled in the 
adjacent interval at the symbol decision time. This effect was termed 
intersymbol interference. In this section we shall discuss ways of reducing 
intersymbol interference. 

Pulse Shaping to Reduce Intersymbol Interference 

Consider a typical case where N equal-length time slots are available 
during a frame time, denoted by T s . With N b binary bits in a time slot the 
duration of a bit interval is T b = TjNN b . If, at the time of any one sample, 
the voltage is entirely due to the symbol corresponding to the sample, there 
is no intersymbol interference. This situation corresponds to waveforms 
from all other transmitted symbols passing through a null at the sample 
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time However, since symbols are transmitted every TjNN b seconds, we 
must also require that the waveforms for all symbols pass through periodic 
nulls to prevent interference regardless of which sample is taken. 

One example of a theoretically suitable waveshape, having the form 
sinW/x was encountered in sampling. It corresponded to the response of 
an ideal lowpass channel when excited by very narrow (impulsive) transnutted 
Dulses To employ this waveform in the more general problem, 
product of the spectrum of the transmitted waveform, the channel transfer 
function, and the receiver matched filter would have to equal an ideal 
lowpass-type rectangular function of frequency. Such a result is not reahzaWe 
and is difficult to approximate in practice. Even if it could be realized 
there are still practical considerations in maintaining adequate synchromzation 
of sample point timing that render the sin(x)/x 

An alternative approach was developed in 1928 when H. Nyquist 
(Transactions oftheAIEE, Vol. 47, February, 1928, pp. 617-644) demonstrated 
how a pulsed signal having the desired distribution of nulls could be achieved 
while at the same time its spectrum magnitude approximated a realizable 
filter transfer function. Nyquist’s result has been called the vestigial-symmetry 
theorem [5] owing to the form of the spectrum Itm aY haveanyshap^ 
as illustrated in Fig. 4.6-l(a), so long as it is real and odd symmetn' exists 
about the points o> = ± W. Clearly, the form of the spectrum s one that 
may be approximated by a realistic system having a gradual roll-off char- 
acteristic It remains to show that the location of nulls is as claimed. 

Proofs of Nyquist’s vestigial-symmetry theorem usually follow the 
asumption W, ^ W, the case of usual practical interest. On making this 
assumption we may decompose the spectrum into the sum of a 

rectangular component and components with odd symmetry about ±W, as 
illustrated in Fig. 4.6-l(b) and (c). Inverse transforms of the components 
are easily shown to be 

W sin (Wt) (4.6-1) 

J -Wi 

,(*> + WO sin (cut) da>. (4.6-2) 

0 

The time function s 0 {t) corresponding to the spectrum SJ<o) becomes 

r Wx 

W sin (Wt) j 2 (I H x { ce + WO sin (cot) da) . (4.6-3) 

s ° (l> tt Wt l Jo -1 

Now regardless of the precise value of the term within the brackets, the 
factor sin (Wt)/Wt guarantees the existence of periodic nulls. Hence, fo 
almost arbitrary choice of H,(o»), intersymbol interference iszeroif symbols 
in the PCM pulse train are conveyed in pulses having the shape of s a ( ). 
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Figure 4.6-1 . Spectrums associated with Nyquist’ s vestigial-symmetry theorem, 
(a) A spectrum with the required symmetry, (b) its rectangular component, and 
(c) its components with odd symmetry about ±W [2]. 
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As an example, consider the raised-cosine spectrum defined by 
, \o*\ < W - W, 

s.(.) - 1 - i=os[^(W - W + w,>], w- w,«M« w + w, 


w+ W,<|o>|. 


(4.6-4) 


(4.6-5) 


Straightforward inverse Fourier transformation of (4.6-4) produces 

W sin (Wt) [ cos (W,f) (4.6-5) 

S ° (t) ~ 7 r Wt L 1 - (2W,t/rr) 2 . ' 

The above two functions are illustrated in Fig. 4.6-2 f or JV./M/ 0. 0-^ 

and 1.0. Several important observations may be madeabout thepulsesof 
(b) When WJW = 1.0, the sidelobes are very small (31.5 dB or more 

below the peak) indicating that intersy mho 1 mterterencecanbe madesma 
, _ • nrpwnfe 0 f timing errors. Furthermore, additional nulls occur 

at t = ±(2n + 1 )tt/2W, n = 1,2,... .which also > tend to 

that a polar signal constructed from this basic pulse will have possible nulls 
precisely tt/W seconds apart.t This is an advantage, since synchronization 
signals may be extracted from a received digital waveform having 

Pr ° P With pulse shaping we may still transmit code symbols one symbol 
per n/W seconds. In other words, our signaling rate ts W/ir. However, 
observe that the total band required with pulse shaping becomes ■ 

rather than W. Thus compared with an ideal stgnalmg rate o (W + WJ/ 
symbols per second, we suffer a reduction m rate by the factor W/(W + wo 

Partial Reapon.e Signaling For Interference Control 

Waveforms satisfying Nyquist’s vesng.al-sy mmetry theorem theoret- 
ically cause no intersymbol interference because o their per.od.c nul s 

iSLe to a U condition of comM interference, bandwidth .ncreases 
a " Tor 'example we coold select a symbol waveform so that it interferes 

'EM Z 

t Actual null locations will correspond to the transition points in the binary sequence 
of ones and zeros. 
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Figure 4.6-2. Pulse shaping via Nyquist’s vestigial-symmetry theorem, (a) Spec- 
trums and (b) shaped pulses [2]. 
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terference occurs, we can conceivably correct (remove) it during demod- 
ulation. We construct a suitable waveform starting from the minimum- 
bandwidth pulse described previously that satisfies Nyquist’s vestigial- 
symmetry theorem; it is defined here byt 

„(.) - S.[=^] <«'«> 


P,(o>) = T b recti — Je x P 


-jllTOi 


(4.6-7) 


This waveform is illustrated in Fig. 4.6-3. If the waveform corresponds to, 
for instance, the symbol interval from t = 0 to / — T b , it clearly has nulls 
at the sample times of all intervals, except its own at t - I„. inus it 
causes no intersymbol interference, but it cannot be realized, as noted 

P Now we construct a new waveform, p 2 (t), by adding p,(t) with itself 
delayed one symbol interval, as follows: 
p 2 (t ) = P\(t) + P\( { ~ Tb) 


„ ir(t ~ T b ) 

= Sa 

—[(£) 

'(Me 


\n(t - 27V) 


(4.6-8) 


The spectrum P 2 (oi), of p 2 {t), is 

/>(&>) = / j ,( oj ) + P t {w)exp(-ja>T b ) 

( W \ (~j™\ ^ 

= F l(w ) 2cos^-jexp^— j. 

Both p 2 (t) and |P 2 (o>)| are illustrated in Fig. 4.6-4. If p 2 (t) corresponds to 
the bit interval from , = 0 to f = T>, it contributes an amplitude TO to 
the interval’s sample taken at f = TV; it also contributes amplitude FO 
to the second interval at its sample time t = 27V, which we cal a partial 
response. No other intervals have contributions from the first interval. 
Thus the sample in any given interval contains a partial response from only 

the one preceding interval and no other. . 

Consider interval 2, and assume a polar PCM case where digits 1 or 

t All waveforms defined in the remainder of this section are based on p,(0, which is 
defined so tharit haTunit maximum value. The unit-amplitude choice will allow waveforms 
to be generated by a model like that of Fig. 3.6-2 of Chap. 3. 
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u b 0 u b u 

2 T 


(b) 

Figure 4.6-3. (a) A waveform that satisfies Nyquist's vestigial-symmetry theorem 
with minimum bandwidth, (b) The magnitude of the spectrum of the waveform 
of (a). 

0 in any interval result in transmitting either p 2 (t) or - p 2 {t), respectively. 
Only four possibilities exist for intervals 1 and 2. They are sequences 00, 
01, 10, and 11. Examination of the waveform sequences shows that the 
samples in interval 2 are either -2, 0, 0, or 2, respectively, for these four 
sequences. It follows that the decoding logic is: If the sample amplitude 
is 2, the digit is declared a 1 (the preceding digit is also a 1); if the ampli- 
tude is -2 the digit is declared a 0 (preceding digit is also a 0); if the 
sample is zero the digit is detected as a 1 if the preceding interval’s decision 
was a 0, and detected as 0 if the preceding decision was a 1. The reader 
may recognize this decoding logic as being exactly the same as for the 
duobinary waveform of Sec. 3.6 that used rectangular pulses. Thus p 2 (t) 
of (4.6-8) used in partial response signaling is a duobinary waveform. As 
with the rectangular duobinary signal, precoding can also be used with the 
duobinary waveform using p 2 (t). Decoding logic is the same as described 
in Sec. 3.6. 

The advantages of the above partial response waveform are (1) it 
transmits at data rate f b = l/T b bits/s while requiring only a bandwidth 
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(b) 


Figure 4.6-4. (a) A waveform used for partial response signaling and (b) the 
magnitude of the spectrum of p:(r). 


;/ 2 Hz,t and (2) the spectrum of p 2 (t) is more easily approximated in 

iractice' than the ideal rectangular spectrum. 

By a slight modification another useful partial response waveform can 
be designed that has only small spectral values at frequencies near dc. 
Construct p 3 «) as follows: 

p } (t) = P i(0 ~ Pitt - 


ttU - T b ) 
T h 


TT {t - 37),) 
T b 


_ In sin[{n/T h )U - 2 T b )] 
= \(n/T b ){t - 2 T h )Y - * 


♦ Bandwidth efficiency is therefore 2 bits/s/Hz. The waveform of Fig. 4.6-3(a), cor- 
responding to an ideal rectangular spectrum, also has this bandwidth efficiency but is more 
sensitive to pulse rate perturbations and its spectrum is less easily approximated. 
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The spectrum, P 3 (o>), of p 3 (t) is 

P } (a>) = Pi((o) 2 j sin[^^]exp(— . (4.6-11) 

The waveform p 3 (f) and its spectrum magnitude are shown in Fig. 4.6-5. 
If a polar PCM waveform is constructed by generating p 3 (t) or - Pi (t) in 
each digit interval having a 1 or 0, respectively, the reader can readily 
verify that the waveform is modified duobinary. It can be decoded using 
the same logic described in Sec. 3.6 for either direct or precoded data 
sequences. 

Generalized Partial Response Signaling 

Duobinary and modified duobinary systems were introduced by Lender 
[6-8] and generalized by Kretzmer [9, 10]. The generalized systems are 
extensions of the models used in duobinary and modified duobinary systems, 
as illustrated in Fig. 4.6-6 [11]. As in Sec. 3.6, the sequence of (independent) 
data digits (Os and Is) determine the coefficients a k . Coefficients w 0 , vv, , 
• . • , w Np _. | have integer values and N p - 1 denotes the number of adjacent 
intervals over which partial responses may be nonzero when P,(cd) is defined 
by (4.6-7). For example, 

w 0 = 1 (4.6-12) 

w, = 1, (4.6-13) 

and Alp = 2 in the duobinary case, whereas 

w 0 = 1 (4.6-14) 

w, = 0 (4.6-15) 

w 2 =-l, (4.6-16) 

and Alp = 3 in the modified duobinary case.t 

From Fig. 4.6-6 the impulse response of the waveform generator, 
denoted as p Np {t), is 

N p - 1 

PNpU) = 2 w tPi(‘ - iJ b) (4.6-17) 

/ = 0 

which is the typical signal generated by a signaling interval. Its spectrum, 
Pn p {.<*>), is 

N p - 1 N p - 1 

PnJM = PM 2 w,„e- jW ‘ = P,(<o) 2 *mD"' (4.6-18) 

m = 0 m — 0 

where 

D = e- J “ n . (4.6-19) 

t In duobinary and modified duobinary cases, a, = A/2 or -A/2 for binary 1 or 0, 
respectively. 
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Figure 4.6-5. (a) A waveform used for a modified form of partial response 
signaling and (b) the magnitude of the spectrum of p,l t). 


The sum multiplying PM in (4.6-18) that involves D is called the partial 
response system’s polynomial; because PM has a value only for < 

O) < co b /2, the system polynomial determines the shape of the spectrum of 
the generalized waveform but not the spectral extent. Many system poly- 
nomials have been studied and the most practical cases tabulated 111, 1ZJ. 


Example 4.6-1 _ . j- r „ m 

We determine p Np {t) and P N » when N p = 3, w„ = 1, w, - 2, and w 2 - 1. From 

(4.6-17) using (4.6-6) 




rr(t - 3 T b ) 
T„ 


i 
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Output 



w N p ~l 


Figure 4.6-6. Model system for generating a generalized partial response signal, 
which reduces to 

2n 2 Sa[7r(t - 2T„)/T b ] 
tr 2 - [(n/T b )(t - 2T b )f 

From (4.6-18) using (4.6-7) 

F,( w) = P,(oj) [1 + 2e~ J “ n + e- j2 “ n ] = P,(w)[l + 2D + D 2 ] 

= PMd + Df = P,M 4 cos 2 ||— jexp(-)(ur fc ) 

= P,(<o)4 cos 2 ^— jexp^ 

where the system’s polynomial is 1 + 2D + D 1 . A polar PCM waveform with 
a t = ± 1 using p 3 (f) will have five levels possible at sample times occurring at the 
ends of sampling intervals. These levels are 4, 2, 0, -2, and -4. 


For other details on generalized partial response signaling the reader 
is referred to the literature. Pasupathy [13] gives a good summary of the 
concepts. Others have studied optimization of the receiver [14-16] and 
provide additional references. 
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Equalization Methods 

to Control Intarsymbol Interference 

In two of the three preceding subsections we have explored ways of 
designing waveforms to have controlled intersymbol interference, which, 
by signal processing techniques, is removed in the receiver. The other 
subsection dealt with designing a waveform to have no intersymbol inter- 
ference (ISI). In this subsection we introduce another approach to the 
control of ISI. 

Suppose a pulse has been designed for transmission over a given 
channel so that the channel’s response pulse, denoted by p cb (t), has no ISI. 
An example of p ch (t) would be a pulse satisfying Nyquist’s vestigial-symmetry 
theorem. Thus samples of p ch (t), taken on either side of the maximum (at 
t = 0) at multiples of the bit interval T b , are ideally zero. Next, suppose 
that the channel has a somewhat different transfer function than that expected 
in the design. The channel’s actual response, denoted by p c (t), is different 
from p ch {t) and may contain ISI at sample times. t However, if the actual 
channel response is passed through a specially designed filter, called an 
equalizing filter, it is possible to restore the zero ISI condition at a finite 
number of sample times in the equalizing filter’s output signal. 

The structure of the equalizing filter, sometimes also called a transversal 
equalizer, is shown in Fig. 4.6-7. We assume a delay line with total delay 
2 NT b that is tapped at intervals of T b to produce 2N + 1 taps. The signals 
at the taps are weighted by constants C„ , n = 0, ± 1 , ± 2, . . . , ± N, prior 
to being summed to produce the final output, denoted by p c Jt). The response 
is 

p eq (t)= 2 C nPc [t - (N + n)T„]. (4.6-20) 

n~-N 

The signal at the tap for n = 0 is delayed by NT b , which we call the 
nominal delay of the filter. By choice of the constants C„ , it is possible 
to force p eq (t) to be zero at N sample intervals on either side of the desired 
peak output (which will occur at the nominal delay, as noted below). 

To find the constants C„ that lead to the desired behavior, let samples 
of (4.6-20) be taken at times t = (N + k)T b , k = 0, ±1, ±2, . . . , ±N. 
There will then be 2N + 1 equations (one for each sample time) and 
2 N + 1 unknown coefficients C„ . These equations can be placed in matrix 


t Channel variations can occur, for example, in telephone systems where the actual 
signal path is determined by switchgear and destination. Clearly, a long distance call will 
involve a slightly different channel than a local call. 
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form as follows 

>„( 0 ) ' 

Pe„(T b ) 

,p eg (2NT b )_ 


PM PA-T b ) 

PcUb) Pci 0) 

p c (2NT b ) p c [(2N - l)T b ] 


p c {-2NT b ) H"C-„ 
p c [(-2N + l)T b ] 

C-/V+ 1 


(4.6-21) 

In general, we seek to choose the coefficients C„ such that the equalizer 
samples are the same as though the ideal channel response, s ch {t), was being 
sampled. Thus if we allow the ideal response to have the same nominal 
delay as the equalizer, we require p eq (t) = p ch (t - NT b ) at all 2 N + 1 
sample times. In other words, 


PcWc + N)T b ] = Pch (kT b ) = 


Pchi 0), 
0. 


k = 0 

k = ±1, ±2, ..., ±N, 


and (4.6-21) becomes 


(4.6-22) 


[Peq] = [Pc)[C], 


(4.6-23) 
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where we define matrices 

fWO) 1 

P'„(T b ) j 


lP< q ] = 


p eq l(N- l)T„] 

Pe q (NT„) 

p eq [(N+l)T„) 

pMNT b ) 


C-N 

C-N+l 


(4.6-24) 


Pci 0) Pd ~ Tb) 

PciTt) Pd 0) 

p c (2NT b ) p c [i2N - 1)7U 


Pc i-2NT b ) 
p c [i~2N + 1 )T b \ 


(4.6-25) 


If the inverse of a square matrix is denoted by [•]"', the solution of 
(4.6-23) is 

fC] = [p c ]-'[p eq ]. (4-6-26) 

From the form of [pj we easily find that [C] is the middle column of 
[p c ] ~ 1 multiplied by p c *(0). The desired coefficients C„ are seen to be 
functions of samples of the unequalized channel response taken at times 
t = 0, ±T b ±2 NT b . 


Example 4.6-2 

Let the channel be ideally designed to produce a sampling pulse as shown in Fig. 
4.6-8(a). Due to channel distortion the unequalized channel response of (b) is 
assumed to have ISI at only one sample time on each side of the maximum. Thus 
only p c {-T b ) = 0.1p c/ ,(0), p c (0) = p c *(0), p c (T b ) = -O.ZpJO) are nonzero sample 
values. If we assume a three-tap equalizer ( N = 1) we have 

*o i r i.o o.i o i rc.,- 

Pcb (0) = -0.2 1.0 0.1 Co p c *(0) 

.0 J L o - 0.2 l.oj L c, 

from (4.6-23). Straightforward evaluation gives 


On using these coefficients in (4.6-20) the equalized channel responses at various 
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Figure 4.6-8. Waveforms applicable to the equalized channel of Example 4.6-2. 
sample times are 

Pcqi — T b ) ~ j Q4 Pc/|(0) 

Pcqi 0) = 0 

PcqiTb) = Pch(0 ) 

P< q i2T b ) = 0 
p 4m = PcM. 

Samples at other times are zero. The equalized channel response is illustrated in 
Fig. 4.6-8(c). 
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The transversal equalizer can force the equalized channel response 
to have nulls (zero ISI) at N sample intervals on either side of the desired 
maximum response. It does not guarantee that the equalized response will 
have nulls at sample times beyond ±NT b . In fact, if the unequalized 
channel has N u side intervals with ISI and the equalizer uses 2 N + 1 taps 
with N « N u , there will remain 2 N u sample times with ISI in the equalized 
response. N u of these points fall below t = 0 and N u fall above 2 NT b . 
Only in special cases can the equalizer completely eliminate ISI. However, 
if N is large and ISI decreases as sample times become removed from the 
maximum, ISI can be made small. 

Basically, two methods exist for setting equalizer tap coefficients. 
Preset equalizers use special pulse sequences to establish coefficients [17]. 
Adaptive equalizers use the data signal itself to continually adjust coefficients 
so that some error criterion, usually a mean-squared error, is minimized 
[18]. 


4.7 OPTIMUM DUOBINARY SYSTEM 

To illustrate the application of partial response techniques we shall discuss 
a duobinary system and one approach to its optimization. 

System 

Proper optimization requires that we define the system constraints, 
input specifications, and optimization criteria involved. The system and 
its constraints are defined with the aid of Fig. 4.7-l(a). The models of 
Chap. 3 have been used to represent the transmitter. The transmitted 
waveform s T (t), having average power S T , is the result of exciting a transmitter 
filter with transfer function H T (co) by a sequence of impulses with amplitudes 
a k . For the duobinary system a k has only values A/2 or —A/2 corresponding 
to binary data digits 1 or 0, respectively. The binary digits are assumed 
to occur with equal probabilities, P x = P 2 = 1/2 and the data stream is 
assumed to have been precoded (Sec. 3.6). The input signal to the receiver, 
having average power 5,, is the output of the channel, assumed linear with 
transfer function H ch (a>). The receiver is assumed linear with transfer function 
H r (w). The receiver’s output signal s a (t) will be constrained to have the 
form discussed in Sec. 4.6; this form is defined by the pulse shape of 
(4.6-8). This choice is to eliminate intersymbol interference. 

At the ends of symbol intervals the duobinary waveform has only 
three possible levels, which we define as A, 0, and -A. Thus the output 
signal s„(f) is generated according to Fig. 4.7-l(b), where P 2 ( o>) is given by 
(4.6-9). Due to our choice of output levels, the appropriate decision logic 
is to decide a symbol is a 1 if the receiver response r a (t) at the end of the 
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Figure 4.7-1. (a) A duobinary system and (b) the equivalent filter for transmitter, 
channel, and receiver operations applicable to the signal. 


interval is near zero, and decide a 0 if it is near either -A or A. We 
assume a sample is taken each symbol interval to enable these decisions. 

For input specification we presume the channel noise n(t) is stationary, 
zero mean and Gaussian with power spectrum yjco). We further assume 
the receiver is synchronized to the transmitter so that it knows the starting 
and ending times of symbol intervals. 

The optimization criterion is basically to minimize average probability 
of making a symbol (bit) error, denoted by P e , The minimization is done 
in two steps. First optimum thresholds are found that minimize P e . Second, 
H T (u>) and H R {w) are found that cause P e to be smallest under the constraints 
of constant (fixed) transmitted power S T and fixed output waveform shape 
[specified P 2 (w)]. Thus P 2 (a>), H ch { o>), 5^„(cu) and S T are specified in the 
minimization process. 
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Optimum Thresholds 

Since the input noise has been assumed to be Gaussian, the output 
noise is Gaussian. Denote its probability density and variance (noise power) 
by p „„(«„) and o- 2 , respectively. Then the probability densities applicable 
to the three possible output levels at the sample times are illustrated in 
Fig. 4.7-2. From Sec. 3.6, levels -A, 0, and A occur with probabilities }, 
i, and J, respectively. Since there are three amplitudes, the decision process 
generally involves two thresholds V Tl and V Tl . 


PnS'o + 4) 


PnS'o) PnS'o ~ 4) 



-A V T> 0 V Tl A 


Figure 4.7-2. Probability density functions of duobinary receiver output r„ at 
sample times for three possible signal levels A, 0, and -A. 

Now suppose level -A occurs. A decision error occurs only if 
V Tl < r 0 =£ V T2 .i An error on reception of level A also occurs when V Tl < 
r„ =s V l2 , whereas a zero signal level results in an error only when r„ *£ 
V Tl or V n < r a . Average error probability then becomes 


»„if 

• < Jv, 


PnS r a + A) (lr o 


i r 1 

PnMo ~ A) dr 0 + - I ! - PnS r o) dr a . 


(4.7-1) 


To find V Tl and V Tl that minimize P e , we differentiate (4.7-1) with respect 
to these thresholds according to Leibniz’s rule to get 

dP 1 1 1 

~= - J pJV Tl + A) - -pJV Tl - A) + - pJV Tl ) = 0 (4.7-2) 

f)P 1 1 1 

™ PnXVr i + A) + - 4 pJV Tl - A) - - pJV T2 ) = 0. (4.7-3) 

Typically p no ( v r, ~ A) in (4.7-2) and pJV Tl + A) in (4.7-3) are small in 


t If r„ > V Tl , the decision would be that level A occurred. However a decision for 
level A corresponds to a data 0 as well as a decision for level -A, so no error is made. 
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relation to other terms so that we require 

P JV Tl + A) = 2 pJV T) ) (4.7-4a) 

P«XV T2 - A) = 2 pJV T2 ). (4.7-4b) 

On substitution of the Gaussian function for p„J-), we solve (4.7-4) to obtain 
the optimum thresholds 


= J + J ln(2) = V 7 
V T , = = -V T . 


(4.7-5a) 

(4.7-5b) 


After substituting (4.7-5) into (4.7-1) and using the approximation 
erf [(V r + A)/\ r 2o- 0 ] = 1, we have 


n 3 1 _JA 2 - o-Jln(4)l 1 J. 

- ~ - - ert — —j= — - - erf 

4 4 |_ 2\/2 Aa 0 J 2 [ 

For the important case where A 2 » cr 2 


A 2 + al ln(4) 
2"\/2 A a 0 


3 / A 

‘ 7 erfc — 7= — 

4 \2V2 <r 0 , 


(4.7-6) 


(4.7-7) 


Close examination of (4.7-6) shows that P e is further minimized by 
maximizing A 2 /cr 2 . Within our system constraints this maximization is 
achieved by proper choice of transmitting and receiving filters. 


Optimum Transmit and Receive Filters 

From Sec. 3.7 the power spectrum of a(t) in Fig. 4.7-l(a), denoted 
by y„(o >), is 


?JL») = w- 

4 T b 


Average transmitted power becomes 




(<o)\H t (o>)| 2 dot = 


(4.7-8) 


\H t (o >)| 2 dw. (4.7-9) 


Our constraint that the output waveform s a (t) can have amplitudes A, 0, 
or -A at sample times and have the pulse shape of (4.6-8) is equivalent 
to requiring 

H t (cu) H ch (a>) H r ((a) = P 2 ((a). (4.7-10) 

By solving (4.7-9) for A 2 , substituting (4.7-10) to eliminate H T {a>), and using 
the output noise power relationship 


= Sf n (<o)\H R (u )\ 2 da>, 


(4.7-11) 
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(4.7-12) 


( (<o)| 2 doi 


B(o>) = 


we have 

A^ 167t 2 SjT b (4,7-12) 

Lw£^*L , ** b *** 

Maximization of A 2 /o- 2 is achieved by selecting H R ( o>) to minimize the 
denominator of (4.7-12). An easy application of Schwarz’s inequality (see 
Sec. B.9 of Appendix B) with 

A(o>) = VW \Hr(<»)\ ( 4 - 743) 

= l P;(co) J (4.7-14) 

() \H ch (a)H R (a>)\ 

gives the optimum receiver’s power transfer function 

/ \i2 = (4.7-15) 

lH * ( ' CV^W W C M\ 

Here C is an arbitrary real constant. By using (4.7-15) in (4.7-10) we have 
the optimum transmitting filter’s power transfer function 

\ H ( (4.7-16) 

|//r(co)| " \H C M\ 

The main results of (4.7-15) and (4.7-16) do not restrict the phase characteristics 
of // r (w), H ch (a), and //*(&>), which can have any values so long as they 

combine in cascade according to (4.7-10). 

For white noise where 5 ^„(qj) = -V J 2 the optimum filters become 


(4.7-14) 


(4.7-15) 


(4.7-16) 


_ V2|P,(»)I 


|H ' ( " )|! - V 5 M" 

Except for the scale constant, the transmit and receive filters have the same 

frequency characteristics in this case. ... 

In addition to white noise, if the channel is distortion-free as defined 

H ch M = G ch e~ J (4-7-19) 

where the voltage gain and delay are denoted as G ch and t d , respectively, 
then 

l/M.f - ^ <4J - 20) 


cV^o\pM 


(4.7-17) 


(4.7-18) 


and 


i#rM 2 = I PJf»% 


(4.7-21) 
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In writing these two results the arbitrary constant C has been chosen as 


C = V 2- 


(4.7-22) 


From (4.7-20) we note that the receiver’s gain is adjusted to compensate 
for any loss over the channel to keep the output levels fixed. 

Example 4.7-1 

A duobinary system uses a distortion-free channel with a power loss of 80 dB. If 
A = 2 V is required and the symbol duration is T b = 4 ms, we find the average 
transmitted power and input average received power. 

From (4.6-9) and (4.6-7) 


|/ > 2 (to)| = 2 T b cos( ™j rect^j. 

By use of (4.7-21) in (4.7-9) we have 

„ A 2 1~"* /2 / tto)\ a 2 1000 

St = ~r~ I cos — da = —— = = 318.31 W. 

4w -/- w /2 \a b ) rrT b tc 

Because G] h = 10' 8 for a power loss of 80 dB, the input average power to the 


receiver is 


5, = 318.31 (10 8 ) = 3.18 (iW. 


Error Probability 

After maximizing (4.7-12) using Schwarz’s inequality, it is found that 
the maximum value is 


2tt 2 S t T„ 

I r Vw [P 2 (a))l 

l«c»l ' 


(4.7-23) 


Error probability derives from using (4.7-23) in (4.7-6) or (4.7-7). 
For a distortion-free channel with white noise, we readily find 


A ) = Z / Gin S t T h 

CT 0 / max 4 V 


(4.7-24) 


z IgLstT 
4 V Jf 0 


(4.7-25) 


from (4.7-7). By recognizing that S T T b is the average energy per bit, we 
can write (4.7-25) as 


F,~-erfc 


(4.7-26) 
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Example 4.7-2 

Again assume the system of Example 4.7-1. For a white noise channel where 
= jo' 8 / 4 i 7 W/Hz, we find P,. From the previous example, S T T b = A/tt- 
4/rr so e = 16(10*) and P, « 0.75 erfc [(rr/4)Vl6) = 6.97(10" 6 ) after use of (F-5) 
in Appendix F. 


4.8 OPTIMUM MODIFIED DUOBINARY SYSTEM 

Suppose we are interested in constructing an optimum modified duobinary 
system using the waveform shape of (4.6-10) having the spectrum PM of 
(4.6-11). If the receiver output waveform has possible values A, 0, and 
-A at the sample times, taken at the ends of the symbol intervals, it is 
found that all the results given in Sec. 4.7 apply if PM replaces PM- 
Thus Fig. 4.7-l(a) applies to the modified duobinary system with a k = A/2 
or -A/2 when binary data digits are 1 or 0, respectively. It is necessary 
only to replace P 2 (<ii) with P 3 (o>) in Fig. 4.7-l(b). 

4.9 ftf-ARY PAM SYSTEM 

When the transmitter can issue any one of several (M) pulse amplitudes 
during a symbol interval of duration T, we refer to the system as M-ary 
pulse amplitude modulation, (M-ary PAM). In this section we examine 
M-ary PAM under the assumption that each of the M possible waveform 
levels corresponds to one of M equally probable messages from a discrete 
source. The messages are also assumed statistically independent from one 
symbol interval to the next.) Thus the probability that a given message, 
and therefore a given pulse level, is transmitted is 1/M. 

Because levels are independent and equally probable, each represents 
log 2 (M) bits of information occurring each T seconds. The rate of information 
transfer in M-ary PAM is then (l/T)log 2 (M) bits/s. If M is a power of 2, 
such as M = 2 Nb , so N b = log 2 (M), a direct comparison to binary PCM 
can be made. If N b is the number of binary bits per symbol, and if these 
N b bits occur over T seconds, the same is in M-ary PAM, the two systems 
convey information at the same rate. However, in the process the binary 
PCM system must transmit N b pulses while the M-ary system transmits 

t More general systems can be developed, but the required mathematical detail makes 
the important results less obvious. 
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only one (during time T). The bandwidth required by the M-ary system 
is, therefore, smaller than that of the binary system by a factor \/N b . 
Bandwidth saving is the main advantage of M-ary PAM; the saving is paid 
for by either decreased performance in noise, or a need for increased 
transmitter power. 


Average Transmitted/Received Power 


We shall assume that transmitted pulses are rectangular with amplitudes 
At, i = 1, 2, . . . , M, that are uniformly separated by an amount A. 
Amplitudes A, are assumed to be symmetrically displaced about zero. Hence, 
A 

A, = - (2/ - M - 1), / = 1, 2, . . . , M. (4.9-1) 

If M is odd, levels are 0, ±A, ±2A, . . . , ±A(M - l)/2. If M is even, 
levels are ±A/2, ±3A/2, ±5A/2, .... ±(M - l)A/2. 

If no loss is assumed over the channel, the average power at the 
receiver, denoted by .S', , is the same as the transmitted power and is 

S< = I A 2 P(A,) = X A? (^j (4.9-2) 

where P{A t ) = 1/M is the probability that level A, is transmitted. After 
substitution of (4.9-1) and use of known sums (Appendix D), (4.9-2) reduces 
to 


= A 2 (M 2 - 1) 

12 


(4.9-3) 


Optimum Thresholds 


At the ends of the symbol intervals, an optimum receiver will observe 
(sample) its output level and decide which of the M levels was most likely 
to have been transmitted. Now regardless of the exact form of the optimum 
receiver — as long as it is linear — the output levels, denoted by V h will be 
proportional to received pulse levels A, and will be uniformly spaced by 
an amount denoted by V. Clearly, V is proportional to A, the received 
level separation. 

Since channel noise is zero-mean, white, and Gaussian with power 
density df 0 /2, by assumption, the output noise at the sample time is Gaussian 
with mean V, when the received pulse’s amplitude is A, . Thus the probability 
densities of Fig. 4.9-1 apply to the M possible output levels V,. To establish 
which output level is actually received, we divide all possible receiver 
output levels, r ot into regions separated by thresholds V Tn as shown. The 
first step in optimizing the system is finding optimum values of V Tl . 

Let /?„„(•) denote the probability density of receiver output noise at 
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PnS'o ~ v 0 


PnS'o - y,) 



V>\ Vi\ 


y,-i I r, Ki + i 


V T, V Ti 


V t r T 

• i - 1 *i 


V T M . t V T u -\ 


Figure 4.9-1. Probability density functions applicable to the output of an Af- 
ary PAM system (at the sampler). 

the sample time of an interval. If level V, is received the probability of 
error, given V,, is 


W,)- f 

Jv 7 


PjTo - Vj ) dr 0 


(4.9-4) 


which is the leftmost shaded area in Fig. 4.9-1. Similarly, if V, is received, 
P(e|V f ) = J PnS r o ~ Vi) dr 0 + J p no {r 0 - V,) dr 0 (4.9-5) 


for i = 2, 3, . . . , M - 1. This probability is the shaded area about V, 
in Fig. 4.9-1. For level M, 

rvr\t - 1 

P(e\V M ) = I PnMo - V M )dr 0 . (4.9-6) 

— oo 


The average probability of error, denoted by P w , is then 

M , M 

f.-IWJWJ-iSW.) ,4.9.7) 

- f f P.Jr. - V,)dr. + f pjr. ~ • 

M i=1 L Jvt, 

By differentiation of (4.9-7) with respect to V Ti , using Leibniz’s rule, we 
establish a set of equations that, when set equal to zero, leads to the optimum 
thresholds V Tl . It is found that 

= W+i + Vd' . m lf 2> ^ M _ y (4.9-8) 

which fall midway between signal levels V, . These values of V Ti are illustrated 

in Fig. 4.9-1. , 

Probability of error given by (4.9-7) can now be found by substituting 
the optimum thresholds and the Gaussian form for p nc ,(')- If is fh e * n P ut 


(4.9-8) 
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noise power at a sample time we readily find 



Optimum Receiver 

Since the complementary error function, erfc(-), is a monotonically 
decreasing function of its argument, (4.9-9) is minimum when V 2 /a] is 
maximized. But V 2 /o* is maximum for a receiver that maximizes the output 
level peak power to average noise power ratio at the sample time, because 
V is the difference between adjacent signal levels (V = V i+1 - V,). Thus 
the optimum receiver is a white-noise matched filter. For the rectangular 
pulse form assumed, its transfer function can be written ast 

H R ((o) = e- JaT/2 SafyV (4.9-10) 

During a symbol interval, the received signal pulse is 

s t (t) = A, rect - — . (4.9-11) 

By computing the response of the receiver at time T to this input, the reader 
can easily show (Prob. 4-48) that 

Vi = A,, i = 1,2 M. (4.9-12) 

so 

V = A. (4.9-13) 

In a similar manner, when noise is considered we find 

2 ^0 

cd = (4.9-14) 

Error Probability 


By using (4.9-13) and (4.9-14) in (4.9-9) we have 



(4.9-15) 


Another useful form for word error probability follows the substitution of 
(4.9-3) 



(4.9-16) 


t This is an application of (B.9-7) with f 0 chosen to make the output level maximum 
occur at the sample time T. 
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A final form results from using the ratio of average energy per bit of 
information to twice the channel noise density, as given by 

e = M . (4.9-17) 

A' o log 2 (M) 

We then obtain 


MV»]- 


(4.9-18) 


Figure 4.9-2 illustrates the behavior of (4.9-18). Because the factor 
(M - 1 )/M does not vary greatly (i to 1), we find that average energy per 
information bit must increase by a factor ( M 1 - l)/3 log 2 (M) as M increases 
above 2 to maintain P w approximately constant. 


Example 4.9-1 

An eight-level PAM system transmits rectangular 50-/ts pulses with amplitudes 
A. = ± i v, ±3 V, ±5 V and ±7 V over a lossless channel for which A 0 = 
5.2(10' 6 ) W/Hz. We find average transmitted power and P w . Here A = 2 V, so 
transmitted power, from (4.9-3), is 

C 4(8 2 - 1) _ 63 
“ 3 


(same as received power because of the lossless channel assumed). From (4.9-15) 


„ 8 - 1 - 
P w = — — - erfc 

O 


'4(5.2)10- 


= ^erfc 


|| = 1.06(10- 5 ). 


Performances of M- ary PAM and binary systems can be compared 
through their word error probabilities. In the latter case P w derives from 
(4.5-3) using (4.4-5) for a polar format; it can be written ast 

log;(M) f ISJ \ 1 (4.9-19) 

w 2 |V-M 

where 5, is the received average signal power. For values of P w < 10 \ 
the complementary error functions in (4.9-19) and (4.9-16) are so sensitive 
to their arguments that their coefficients have only small affect on the 
arguments that correspond to equal P„. The two error probabilities are 
therefore approximately equal when 


l vuu UV M * 

Jfa y 


S,T h „ . . 3 S t T 

(bmary) " (M 2 - Wo 


(M - ary PAM). 


(4.9-20) 


Consider two cases. 

First, assume the two systems have equal values of M, P w , and A" 


t Since k = N b , the number of binary bits, we have k = logj(M) by assuming 
M = 2 m . 
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101og 10 (e),dB 

Figure 4.9-2. Error probability in M-ary PAM. 

and transmit the same information rate (this requires T = T b log 2 (A/)). The 
M-ary system then requires less bandwidth (by - a factor l/log 2 (M)) and 
more power (by a factor (M 2 - l)/31og 2 (M)). In the second case the 
systems have equal values of M, P„, Jf 0 , and bandwidth (this requires 
T - T b ). The M-ary system requires still more power (by a factor 
(M - l)/3) but transmits information at a higher rate (by a factor log 2 (M)) 
compared to the binary system. From these two cases we conclude, overall, 
that the main disadvantage to M-ary PAM is increased power required for 
a given performance. Its main advantage lies in its ability to use less 
bandwidth for a given information rate. 

In Chaps. 1 and 3 we indicated that some systems are capable of 
direct digital encoding of the outputs of sources with memory — that is, 
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sources with outputs that are correlated with time. In the next and the 
following three sections, several such systems are discussed in detail. 


4.10 DELTA MODULATION 

Delta modulation (DM), a European invention, was apparently first discussed 
in the English-language literature by de Jager [19]. It may be described as 
a type of PCM where only 1-bit encoding is used. One-bit coding is made 
possible by a feedback loop which is an integral part of the encoding 
process. In the ensuing development of DM principles, the simplest form 
of implementation is assumed. It has an ideal integrator in the loop feedback 
path and will be called a basic delta modulator. At the end of this section 
we consider some alternative configurations based on practical integrators. 

System Block Diagram 

Figure 4.10-1 illustrates the components of a DM system applicable 
to a single message. If time multiplexing of several messages is to be used, 
the separate pulse trains for the various messages may be added at point 

A. They are separated in the receiver by a suitable demultiplexer at point 

B. We shall assume in this section that N similar messages are multiplexed, 
each being bandlimited with maximum spectral extent W f . 

At the transmitter end of the channel, the pulse modulator accepts 
an input train of constant-amplitude clock pulses occurring at a rate o) s /2tt 
pulses per second. It converts these pulses to an output train of polar 
pulses with polarity determined by the “loop error signal — e q (t). If — e q (t) 
is positive at the time of any given input pulse, the output pulse is positive. 
If - e q (t) is negative, a negative output pulse is generated. The error - e q (t) 


Clock Pulse Train 



Figure 4.10-1. Block diagram of a delta modulation system [2]. 


is the difference /(?) - f q (t) between the input message /(t) and /„(;), which 
is the integrated version of the generated pulse train s DM (t). We assume 
that these generated pulses have a duration r which is short relative to the 
time T s — 2ir/a) s between “sample” pulses. Thus, the (ideal) integrator 
output f j(t) will be a stair-step” type of waveform with an up-going step 
for each positive pulse and a down-going step for a negative pulse. Each 
transmitted pulse is clearly a 1-bit encoded version of the signal -e q (t). 
Now since the action of the loop will be to force f q (t) to be an approximation 
to /(/), a fact made more obvious in the following paragraph, the overall 
action of the modulator is to transmit 1-bit information about the changes 
in /(/) with time. 

To better understand operation, assume f{t) is applied, as shown in 
Fig. 4.10-2(a), at t = 0. Let the input clock pulse train timing be such that 
a pulse occurs at t = 0. At t = 0, and for a short time thereafter, fit) > 
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f Q (t) and output pulses are all positive, as illustrated in (b). During this 
initial or transient period the signal f q {t) increases in steps of 8v approaching 
the message until it finally exceeds it in voltage. After the transient period, 
fq(t) will become approximately equal to /(f) and both positive and negative 
output pulses are generated with about equal regularity. The behavior of 
the error e q (t) is illustrated in (c). Except during the transient period the 
error never exceeds ±8v , where 8v is called the step size. 

At the receiver end of the channel a matched filter is shown in Fig. 
4.10-1 to provide maximum noise immunity (see Appendix B). The next 
step involves a reconstruction of the transmitted pulse train to remove 
noise. The operation is similar to reconstruction in PCM where, at the 
times of the maximums of the pulses emerging from the matched filter, 
samples of the received signal are compared to a voltage threshold level 
(0 V here) to determine which polarity of pulse was received. Based on 
these pulse-to-pulse decisions, a version s DM (f) of the original signal SdmW 
is reconstructed. In a practical receiver the reconstruction stage might be 
omitted (especially if only one message is involved). We include it here 
because it aids in both the understanding and the mathematical analysis of 
DM. If multiplexing is involved, the pulse trains are separated at point B, 
and each is applied to an integrator. For a typical message, the integrator 
output f q (t) is approximately equal to the signal f q (t) in the transmitter 
(except for noise errors in reconstruction it would be exactly equal to f q (t) 
in principle). Thus/,(f) »/,(f). Finally, a lowpass filter, having a bandwidth 
W f , is used to smooth out the sharp step-behavior of/,(f). The final output 
becomes /(f) plus errors which we may collectively consider to be noise. 

Errors and Slope Overload 

Even if receiver random noise is negligible, the reconstructed message 
f q (t) in Fig. 4.10-1 can at best equal f q (t). It can never equal the original 
message /(f) because of the 1-bit quantization involved. However, since 

f q (t) = /M + e,(D. ( 4 - 10 ’ 1 ) 

the receiver may be thought of as recovering /(f) with an error e q (t). This 
concept for handling errors was already introduced in the analysis of PCM, 
where e,(f) was called the quantization error. It and amplitude overload 
errors were the only two nonnoise type errors in PCM. In DM there are 
also two types of nonnoise errors. One, called granular error, e g (f), cor- 
responds to the error due to quantization in the 1-bit coding process. The 
second is called slope overload error, e so (t). The composite error due to 
both granular and slope overload errors is called quantization error in DM. 
Thus in terms of the component errors making up e q (t) we may write 
(4.10-1) as 

m = /(0 + e*(0 + ejt). 


(4.10-2) 
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Granular error occurs when /,(f) =/(f) + e /0— that is, during periods 

of time when there are no transient errors occurring. Slope overload errors 
occur when the input message at the delta modulator has a slope exceeding 
the slope capability of the feedback signal /,(/). From Fig. 4.10-2(a) the 
maximum generated slope capability is S v/T s . Thus to prevent slope overload 


df(t) ___ 8v 

dt max 35 T s 


(4.10-3) 


is required. If, for short periods of time, (4.10-3) is not satisfied, slope 
overload errors occur. Slope overload may be corrected for a given message, 
by increasing the step size 8v, or by decreasing T s (sample at a higher rate).' 
Both these solutions are later found to be undesirable, and slope overload 
is one of the fundamental limitations of DM. 

To better appreciate the effects of slope overload consider the sinusoidal 
message 


/(f) = A / cos(&yf). (4.10-4) 

From (4.10-3) the region of satisfactory performance corresponds to signal 
levels 


df-g -1 (V) 

8u 2n 


(4.10-5) 


The right side of this expression may be viewed as a kind of spectral 
characteristic for a basic delta modulator, since it is a function of frequency 
oi f . Taken in this light, we expect that DM would be especially applicable 
to waveforms that have a power density spectrum that decreases as the 
reciprocal of the square of frequency. For such signals the various spectral 
components could be made to match, or operate in close proximity to the 
overload level, thereby optimizing performance. Indeed, this is the case, 
and basic DM has been found to perform well with speech or monochrome 
television video messages both of which possess average power spectra 
approximating the type described. 

In order not to convey excessive granular noise A f /8v » 1 is necessary 
in (4.10-5). This condition determines the sample rate as the following 
simple example points out. 


Example 4.10-1 

For an 800-Hz sinusoidal message we find the required sample rate to prevent 
overload when A f /Sv = 20 is to be realized. From (4.10-5) 

fs = osJItt 2= 20o> f = 40tt( 800) = 100.5 kHz. 

Note that this sample rate is 62.8 times as large as the Nyquist sample rate of 1 6 
kHz. 
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Channel Bandwidth 


Assuming N similar messages are multiplexed, the time per message 
sample is TJN. Within this time slot the sample pulse duration is r =s 
TJN seconds. Since the minimum channel bandwidth W ch required to pass 
pulses of duration r is ir/r, we have 


7r ttN _ Nl o s 


Wc * - » - V = 


(4.10-6) 


In practice, one would probably design the pulse duration as large as possible 
( t /N) to convey maximum energy per sample, especially if receiver (thermal) 
noise was significant. Thus W eh would not have to be radically larger than 
the minimum value Na>J2. 


Other Configurations 

There are many variations of the basic delta modulator which uses 
an ideal integrator. In one case the integrator may be replaced by a time- 
varying network which adjusts step size with time to reduce slope overload. 
Such a modulator is called an adaptive delta modulator , and it is discussed 
in Sec. 4.13. In another variation, the message is integrated prior to modulation 
by the basic delta modulator. This variation, called delta-sigma modulation, 
is discussed in Sec. 4.12. It is a means of approximately matching the 
overload characteristics of DM to a message having a relatively flat power 
spectrum (owing to integration, the encoded message power spectrum de- 
creases with frequency and therefore approximately matches the overload 

characteristic of the delta modulator). 

Some variations of the basic delta modulator involve alteration ot the 
feedback path to include more general linear networks that do not change 
with time. Clearly, from a practical standpoint, one cannot realize a perfect 
integrator. A simple lowpass one-section RC filter is often substituted 
instead. Johnson [20] has analyzed such a DM system and found that noise 
performance is unchanged over the basic modulator which has an ideal 
integrator. There is a change in the slope overload characteristic, however. 
If the filter time constant is T c = RC and its 3-dB frequency is w c 
1 /jT c , while channel pulses have amplitude A and duration TJN (maximum 
duration), then the slope overload characteristic for a sinusoidal message 
of peak amplitude A f and frequency ay becomes 



For frequencies above ay the behavior is similar to the basic delta modulator, 
since a 1 /ay characteristic is evident. The region ay > ay again corresponds 
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to slope overload. For frequencies ay < ay the characteristic becomes flat, 
corresponding to amplitude overload, a condition not present in basic DM. 
For speech, this behavior is not serious, since the speech spectrum also 
tends to reach a maximum at lower frequencies. 

A delta modulator may also contain two integrators in the feedback 
path. In this case message slope differences are encoded as opposed to 
amplitude differences. Instabilities may result, however, for two perfect 
integrators, and some practical modifications are required. The usual practical 
circuit is a two-section RC lowpass filter having an extra (stability-controlling) 
resistor placed in series with the output capacitor [19, 21], Such modulators 
are called delta modulators with double integration. They may give improved 
noise performance compared to single-integration DM for speech. The 
article by Schindler [21] is a good summary of the various DM configurations. 
In a later paper [22] he includes a fairly extensive reference list which will 
serve those readers interested in additional depth on DM systems. Other 
references can be found in [23], and more detail on other feedback filters 
is available in [24, 25]. 

All the delta modulators discussed above use regularly spaced samples. 
It is also possible to transmit in the form of irregularly spaced samples. 
Such a system, called asynchronous delta modulation, is described by 
Hawkes and Simonpieri [26]. 


4.11 NOISE PERFORMANCE OF DELTA MODULATION 

In this section we consider mainly the noise performance of a single (ideal) 
integrator form of DM system as depicted in Fig. 4.10-1. 

Performance with Granular Noise Only 

Let us first consider receiver input (channel) noise to be negligible. 
From Fig. 4.10-1 and (4.10-2) we have 

fjt) = f q U) = f(t) + ejt) + ejt). (4.11-1) 

Here the recovered message/!/) is corrupted by granular and slope overload 
errors sjt) and e s Jt). We shall treat the errors as noises and seek the 
amount of noise power due to each which reaches the final output through 
the lowpass filter. So far as signal power is concerned, the (ideal) filter 
bandwidth W f is chosen just large enough to pass the signal undistorted. 
Output signal power is then 

S 0 =7W)- (4.11-2) 

Granular noise is essentially uncorrelated with the signal [27] and is 
approximately uniformly distributed [27, 28], over the interval 2Sv. We 
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(4.11-3) 


may gain some intuitive agreement with this fact from the behavior of the 
error in Fig. 4.10-2(c) during the nontransient time interval. The granular 
noise power prior to lowpass filtering becomes 

^0=^. (4.H-3) 

For the filtered output granular noise power N g , we adopt a heuristic 
argument given by Taub and Schilling [29]. By observing the form of e g {t) 
from Fig. 4.10-2(c) we conclude that there is considerable high-frequency 
content in its spectrum. To make an estimate of its extent, imagine e s (t) 
passed through a filter with a cutoff frequency a> c chosen to pass e g (t) with 
only reasonable distortion. The cutoff frequency is then a good estimate 
of the spectral extent of s e (t). If we select the point of “reasonable” 
distortion as that where the filter rise time n/a> c is equal to half the sample 
interval T s , the spectrum of e g (t) is found to extend to a cutoff frequency 
o) c = (o s = 2ir/T s . Now since the power spectrum of eft) is more or less 
flat [27, p. 2120] at lower frequencies, we presume the total power to be 
uniformly distributed over -o> c < w < (o c . Filtered output power now 
easily calculates to 

(4iM) 

Thus in general, the signal- to-granular noise power ratio is 

/So\ = 3^(V| (4.H-5) 

VVdm (*«)* \W f J 

To prevent slope overload it is necessary that the maximum magnitude 
of the signal’s slope, \df(t)/dt\ mm , not exceed the slope capability of the 
system, which is 8v/T s . At some signal level fit), that we call the reference 
signal level, the two slopes become equal, and we choose step size according 
to 

8v= ^ T s . (4.11-6) 

dt ma x 

W ith th is choice slope overload will not occur for any message power fit) 
$ flit). From (4.11-4) and (4.11-5) we have 

N = — ^ ^ (4.11-7) 

' 3 dt 


(4.11-4) 


(4.11-5) 


(4.11-6) 


(4.11-7) 


^ • (4.11-8) 

Wdm 4tt 2 WM/d^W f 

This equation may be put in another useful form by defining a slope crest 
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factor K cr , analogous to the amplitude crest factor of (4.5-9), byt 

f,2 = M/(0Mlmax 
zv rr -- 


(4.11-9) 


[dfit)/dtf ■ J 

and observing that the rms bandwidth of the signal power density 
spectrum 5^(<o) is 


2 [df(t)/dt? 

** rms -jf -- ; 


f co 2 y f (a>)da> 

— GO 

f S^f(oi) dco 


(4.11-10) 


(4.11-11) 


Since these two expressions must hold for any signal level, including the 
reference level, they are used in (4.11-8) to obtain 

{fr ) m (411 - ,1) 

Equation (4.11-11) gives the output signal-to-noise ratio for a single-integration 
DM system. The only system parameter appearing explicitly in the expression 
is the sample rate <u s . By use of (4.10-6) we may put it in terms of channel 
bandwidth: 

^ 6 (W' h ym 


UJ D M 55 n 2 ki Urms/ N } [ Wj) flit) ' (4,1M2) 

Thus performance in DM increases as the third power of channel bandwidth. 

Response of a DM system to a sinusoidal message has special application 
to speech transmission. An example is next used to specialize (4.11-11) 
for a single message )N = 1). 

Example 4.11-1 

We obtain an expression for iSJN g ) DM for a single sinusoidal message fit) = 
fit) = AfCos(o> f t). By using the resul t of Prob. 4-53, we obtain W ras = ai f . For 

Ki we have |rf/(f)/*|Lx = (A f a> f ) 2 and [df{t)/dtf = (A f o> f f/2, so K 2 „ = 2. Finally, 
(4.11-11) becomes 


±(*1 

8 TT 2 \Oi f 


sinusoidal modulation. 


For speech signals it is experimentally found [19] that noticeable overload 
does not occur as long as signal amplitude does not exceed the amplitude 

t Note that the dot does not imply the time derivative of K„. It is simply to denote 
a crest factor for a time-differentiated waveform. 
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of an 800- Hz sine wave set just at the slope overload value. If W f /2ir = 
3.5 kHz is assumed and the typical spectral distribution of speech is considered 
[30], 800 Hz turns out to be W rms /2tr, the rms speech bandwidth. It is also 
known [19] that good speech transmission requires a sample rate of 40 kHz, 
while very good results are achieved for a 100-kHz sample rate. From 
Example 4.11-1 these values show that we must have ( S a /N g ) DM = 1086 
or 30.4 dB for good results or 42.3 dB for very good results. 

For a DM system with double integration and operating with a sine 
wave at its reference level, ( SJN g ) DM may be put in the form [19] 


0 . 0178 ( 3 ) (Wf 
8 ir 2 \ cor 


(4.11-13) 


The ratio of (4.11-13) to the expression in Example 4.11-1 shows that the 
double-integration system signal-to-granular noise power ratio is better by 
a factor of 0.0178(o>,/W» 2 . For W f /2n = 3.5 kHz and o>J2n = 40 kHz 
(good speech reproduction) or 100 kHz (very good speech) the factor evaluates 
to 3.7 dB and 11.6 dB, respectively. These improvements make the added 
complexity worth while for speech systems. 

Performance with Slope Overload Noise Added 

In general the slope overload error e S0 (t) in (4.11-1) is correlated with 
the message. A number of analyses to find the slope overload noise power 
N s0 at the receiver lowpass filter output have been conducted for a single- 
integrator DM system. These analyses [27, 31-33] are complicated and 
beyond the scope of this book. Hence we only state and use appropriate 
results. Greenstein [33] has given apparently the most general solution. It 
is difficult to apply, however, and in the important region of heavy overload 
Greenstein’s work suggests (see his Fig. 7) that Abate s [32] empirical 
solution is reasonably accurate. Because Abate’ s results are easy to use, 
we present them here as taken from Greenstein s paper. In present notation, 
the slope overload noise power can be shown to bet 


m 2 2 

“ 3 WjJ 9 


1 + 3 Z 
Z 2 


(4.11-14) 


where Z, as given by 

Z 2 = (Sl))2 - (4.11-15) 

T] [df(t)/dt] 2 

is known as the slope overload factor. Equation (4.11-14) was derived for 


t Steele [34] gives an approximation that is a simplification of the original result of 
Greenstein [33] but still not as simple as the result used here. 
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Gaussian random or noise-like messages. The bracketed factor is recognized 
as the granular noise power from (4.11-4). 

The output signal-to-quantization noise power ratio is 


'So) = fit) 
TVdm N, + N so - 


(4.11-16) 


To put this equation in a useful form we must observe that a Gaussianly 
distributed message has no well-defined maximum amplitude or maximum 
slope. This means that granular noise power cannot be expressed in the 
exact form of (4.11-7). However, for some reference power level fit) of 
the message, let us set the step size according to 


where we define 



(4.11-17) 

(4.11-18) 


Thus at the reference signal power level, 8v/T, is proportional to the square 
root of the power in the derivative of the reference signal. The proportionality 
constant is K. It is analogous to the slope crest factor K cr for messages 
having a definite maximum slope but not equal to it.t 

Now by substituting (4.11-17) into the expressions for N g and N so , as 
given by (4.11-4) and (4.11-14), we may find ( S 0 /N q ) DM for any signal power 
level different from the reference level: 


where 


(V) x 2 K\SJN e ) DM , r 

\nJ dm 1 + is {(vJWf f (3 + x)x exp(-3/x)’ 


(4.11-19) 



(4.11-20) 

(4.11-21) 


and ITrms is the rms bandwidth of the power density spectrum of the signal 
fit). By substitution of minimum channel bandwidth we have 


(S±) -x 2 K 2 (SJNXu, 

Wdm 1 + 06/9 A 3 ) iWJW f f (3 + x)x exp ( - 3/x) ' (4 ' 1 1 ' 22) 

The quantity (SjN g ) DMr is the signal-to-granular noise ratio when the message 
is at its reference power level. The remaining factor represents the change 


t A more exact interpretation follows a comparison of (4. 1 1-17) with (4. 11-15). fc may 
be considered as a reference value of the slope overload factor. 
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in this ratio caused by the presence of slope overload noise and the fact 
that the message power may be different from the reference level. 

Figure 4.11-1 provides several plotst of (4.11-22) for N = 1. To 
illustrate the practical application of the curves we take an example. 

Example 4.11-2 

In an assumed DM system WJW f = 5 and 8 v/T, is arbitrarily selected to be four 
times the rms signal derivative. Thus K = 4 from (4.11-17). If the system transmits 
only one message, cu, = 21V,* and a ,JW f = 10. Now suppose the message power 
density spectrum is uniform over - W f < <o < W f . Then W™ - W f / 3. From 
(4.11-21) 

(&) = 3(3X10)! = 14 25 , or 11.54 dB, 

\nJ dm , 4 it 2 (4) 2 


so K 2 (SjN g ) DM r = 228.0, or 23.58 dB. Thus will be less than 23.58 dB 

by the amount shown in Fig. 4.11-1 which depends on signal levd_To see where 
the reference level falls on the W c JWj = 5 curve, let f 2 (t) = /)(/). Thus x - 
\/jC l = 1/16. The curve value is -12.05 dB at x 1 = 1/16, or - 12.04 dB. Hence 
(SJN„) dm becomes 23.58 - 12.05 = 11.53 dB. 

Note in example 4.11-2 that the operating point is not near the peak 
of the W ch /W f = 5 curve. The following example optimizes the system. 

Example 4.11-3 

Let us alter the choice of 8 v/T, in the last example to operate at the peak of the 
curve when /fy) = / 2 (f). We require x 2 to be -7.5 dB. Thus x = 0.422 or K = 
2.37. We now have 


Vl -14.25(71=) = 40.59, 


or 16.08 dB, 


and kHSJN ,) DM , = 228.0, or 23.58 dB, the same as before. Now the loss, however, 
is only 8.51 dB'. We obtain QJNJm equal to 23.58 - 8.51 = 15.07 dB, an 
improvement of 3.54 dB over the previous example. 

Effect of Receiver Noise on Performance 

It only remains to determine the effect of receiver (channel) noise on 
the performance of a DM system with a single integrator. Presence of noise 
means that the reconstruction stage in Fig. 4.10-1 will occasionally make 
the wrong decision as to which polarity pulse is received. If a positive 
pulse is received and an error is made, the output generated is a negative 
pulse. We may consider the output as actually containing the true posi- 
tive pulse plus a negative error pulse of twice the magnitude of the signal 


t Problem 4-59 develops optimum values of x in terms of V/JW, such that operation 
is at the curve peaks. 
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Figure 4.11-1. Loss in DM system signal-to-noise ratio due to message level 
variations and presence of slope overload noise. Parameter W c JV/ t also equals 


pulse. In a similar manner, a negative received pulse erroneously recon- 
structed results in a negative signal pulse plus a positive double-amplitude 
error pulse. The overall effect at the input to the receiver integrator is an 
uncorrupted stream of signal pulses plus a stream of occasional double- 
amplitude noise pulses. 

Since the pulses are short duration and the response of the integrators 
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to any given noise pulse will be a step of amplitude 28u or —2Sv, we may 
consider the noise pulses as impulses with amplitudes ±2Sv. The energy 
density spectrum for a single impulse is a constant level 4(6u)‘. Now if the 
average time between noise pulses in any one message channel is T nU then 
the power density spectrum of the noise pulse train may be taken approx- 
imately as 4(St>) 2 /7„, at the integrator input. The effect of the integrator 
is to divide this power density by o> 2 . Finally, we may integrate to obtain 
output power. 

The integral depends on the lowpass filter characteristic. Because DM 
with ideal integrators does not transmit a dc signal well (the reader should 
verify this point for himself), there is no need for the filter to have very 
low-frequency response. If V/ L is its low-frequency cutoff and W L « 
W f , the output noise power is 

N = (4.11-23) 

iy rec t 11/ v 

7„, may be related to the probability P e of any given received pulse’s being 
erroneously reconstructed by approximately P e = Nr/T n t , where r is the 
received pulse duration and N is the number of multiplexed messages. To 
determine P e we may use the same methods as were applied to PCM in 
Sec. 4.4. For received pulses of amplitudes A or -A and white input noise 
of spectral density W 0 /2 we obtain 


1 ^ / /A 2 t\ 

P ‘ = 2^\Jt} 


(4.11-24) 


where erfc(-) is as defined in (B.4-6). Thus the receiver noise power becomes 


/ /A 2 t 

8 ttWjW l Nt erfC W Jf 0 . 


(4.11-25) 


on use of (4.11-4). 

A more convenient expression for N rec results from examination of 
the input to the reconstruction stage of Fig. 4.10-1. For channel pulses of 
amplitude A and duration t, the matched filter’s transfer function is Sa(cui-/2). 
At the time of a typical sample taken in the reconstruction stage, the filter’s 
peak output signal amplitude is readily found to be A in response to the 
input pulse. Peak signal power at the sampler input, denoted by S,, is 
therefore 

S, = A 2 . (4.11-26) 

By integrating the noise power density spectrum at the sampler, which is 
given by CV 0 /2) Sa 2 (wr/2), the input average noise power is found to be 


(4.11-27) 
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Thus peak signal-to-noise ratio at the sampler input is 


and (4.11-25) becomes 

Finally, we neglect slope overload noise for convenience and write 

(So\ M 

UL ‘w MJ1 - 301 

On substitution of (4.11-29) we have 

( M (sjN)™ 

\N / ~ ■ = =r-, 4.11-31) 

1 + — — 5 erfc /-(— U 

vW,W L Nr LV 2 U/dmJ 

where ( SJN g ) DM is given by either (4.11-11) or the numerator of (4.11-19), 

depending on how step size is defined. By substituting for a> s and using 
the minimum channel bandwidth of (4.10-6), another useful form is 


(4.11-31) 



Typical evaluation of (4.11-32) (N = 1 and W f /W L = 20) shows that the 
1-dB threshold signal-to-noise ratio (SJ Nj) DMjh is not a sensitive function 
of WJW f . It varies from about 10 dB to about 14 dB for W ch /W f from 3 
to 140, respectively. 


Comparison of DM and PCM 

When receiver noise is negligible and equal channel bandwidths are 
considered, PCM is always superior to DM for large values of W ch /W f . 
For small values of W ch /Wf it is possible for DM to become superior to 
PCM. The point of changeover depends on signal conditions. The following 
example illustrates that a PCM system output signal-to-noise ratio is 3.3 
dB larger than that of a DM system when W ch /Wf = 5 and the message is 
sinusoidal. For W ch /Wf < 4 it can be found that the DM system becomes 
slightly superior. 

Example 4.11-4 

We compare output signal-to-noise ratios of DM and PCM systems for a single 
sinusoidal message when WJW, = 5. For this message K] r = 2 and K; :r = 2. 
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Assume the message frequency is oif/lv = 800 Hz and Wf/2n = 3.5 kHz, so that 
parameters approximate those of speech transmission in the DM system. Since 
W f = WJWfior DM, the equation of example 4.11-1 gives 

<or2, - 6dB) ' _ 

For PCM, assume N b = WjW f . Equation (4.5-7) with L = 2** and f\t) = 
f)(t) gives 

= 1536 (or 31.9 dB). 

Thus PCM is superior in performance to DM by 3.3 dB. Recalculation for 
W ch /W f < 4 shows DM slightly superior. 


4.12 DELTA-SIGMA MODULATION 

It was noted in Sec. 4.10 that a system using delta modulation performs 
most favorably with messages having a power density spectrum that decreases 
with increasing frequency (as 1 /w 2 ). It is not as well suited to messages 
having an approximately flat power spectrum. Furthermore, another difficulty 
arises if the message possesses a dc component, since it cannot easily be 
conveyed with DM having ideal integrators. Inose and Yasuda [35] have 
described a method whereby DM may be modified to perform well with 
messages having both a flat power spectrum and a dc component. The 
system is called delta-sigma modulation (D-SM). 

A D-SM system is obtained, in concept, by adding one integrator and 
one differentiator to a basic DM system. With reference to Fig. 4.10-1, 
the integrator is added at the input so that an integrated version of the 
message /(t) is applied to the summing point. The differentiator is provided 
in the receiver following the lowpass filter. t 

Overload Characteristic 

The D-SM system may be overloaded, just as the DM system, if the 
slope of the signal emerging from the input integrator is too large. This 
fact should be obvious, since from the summing point to the transmitter 
output we still have a basic DM system. Thus (4.10-3) applies. However, 
the maximum slope is now that of the integral of the message so that, in 


t Our description will presume these two components are actually added, since this 
approach facilitates understanding of the operation of D-SM. In practice, no additional equipment 
is needed as compared to DM. On the contrary, D-SM can actually require one fewer 
component than DM. A practical implementation is discussed in reference [35]. A practical 
design methodology for D-SM is given in [36]. 


order to prevent overload, we require that 

l/,MU ^ 8v/T s (4.12-1) 

be satisfied, where f r (t) is the reference signal level at which bv is chosen. 

For a sinusoidal message, which may be used to infer the overload 
characteristic, (4.12-1) becomes 


bv 2i r ' 


(4.12-2) 


Here A f is the message peak value, bv is the step size, as before, and 
T s = 2v/a) s is the time between transmitted pulses. For transmitted pulses 
of short duration t and amplitude A, step size is given by 

bv = At. (4.12-3) 

Equation (4. 12-2) is independent of the frequency of the sinusoidal message. 
Thus although there is an upper bound to the amplitude that should be 
employed as a function of frequency, the bound is constant with frequency. 
We conclude, then, that D-SM will perform well with messages having a 
relatively flat spectrum. 


Channel Bandwidth 


The output pulse stream in D-SM has the same form as that of basic 
DM. Thus, minimum channel bandwidth is again given by (4.10-6). 

Granular Noise Performance Limitation 


As in DM, the final output of the D-SM receiver will be the recovered 
message f(t) plus noise. The noise will again possess components of receiver 
(thermal) noise, granular noise, and overload noise. We treat only the first 
two and assume that overload noise is small. Even if receiver noise is 
small, granular noise is ever-present and ultimately limits the achievable 
output signal-to-noise ratio. 

To determine the granular noise power reaching the output it is easiest 
to retrace some of the steps leading to (4.1 1-4) for a DM system. As noted 
earlier, the noise power spectrum at the lowpass filter input is approximately 
flat over — V/ f < o> < W f , where W f is the maximum frequency extent of 
the message. By working backward through the filter with the aid of (4.11- 
4), the power density is found to be v(bv) 2 /3w s . This power density also 
applies at the filter output, since it is presumed ideal. Adding a factor w 2 
to account for the effect of the differentiator the output granular noise 
power becomes 


V, 


2tt 



Tr(bv) 2 

3co s 


co 2 dai 


(4.12-4) 
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(8 v) 2 W} 


(4.12-5) 


Signal-to-noise ratio is obtained by dividing output signal power fit) 
by (4.12-5). When we also substitute for the minimum value of Sv, from 
(4.12-1), we have 


®,Y AO 


Wmm \fM Imax \Wf) f){t) ' 

In terms of the crest factor defined by (4.5-9), we have 

(sA _ _ 9 _ 1 (tosYfU) 

\N t ) D-SM 4tr 2 W fW) ' 


(4.12-6) 


(4.12-7) 


For a sinusoidal message K] r = 2 and/ 2 (t) = 7IW, this expression reduces 

V, (4.12-8) 

Wmm 

which is identical to that obtained by Johnson [20]. By comparing 
(4.12-7) with its delta modulation counterpart (4.11-11), we find that both 
depend on the cube of the sampling rate. 

Johnson has also studied the case of practical integrators. That is, 
the integrators are simple, single-stage, RC lowpass filters hav ing a 3-dB 
bandwidth o) c . For a sinusoidal message, his result for pit) = f ft) is 


(4.12-9) 


nJ d . sm ^ 2 \W f ) [1 + 3K/W,) 2 ]' 


Finally, we note that Inose and Yasuda [35] have studied both single 
and double practical RC integrator forms of D-SM. 


Example 4.12-1 

DM and D-SM systems both operate with sinusoidal messages of frequency cu, = 
W f . We compare their performances. From (4.12-8) and the result given in Example 
4.11-1, we have (with W f = w f ) 

{S°L N ° )d M = 3 (or 4.77 dB). 
iS„/N„) dm 

In this particular case D-SM performs better. 


Performance with Receiver Noiae Added 

Receiver noise will occasionally cause the reconstruction stage in Fig. 
4.10-1 to produce a pulse of incorrect polarity. The probability of such an 
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incorrect decision is denoted as P e . Just as in DM, the output of the 
reconstruction stage may be viewed as a stream of correctly recovered 
message pulses plus a stream of error or noise pulses having magnitude 
twice that of the message stream. By applying the previous discussion of 
the error pulse stream for DM, we may justify that the noise power density 
spectrum at the integrator input is 4 i8v) 2 PJNT. Here N is the number of 
multiplexed messages and r is the duration of transmitted pulses. 

Since the integrator cancels the effect of the differentiator (and both 
can actually be eliminated in practice), the output noise may easily be found 
by integration: 



1 rUiSvfP,. 
to J. W/ ~Nr d( ° 


(4.12-10) 


Upon recognizing that P e is again given by (4.11-24), N rec evaluates to 


2(8vf W f 


erfc(\MV‘Y'o)- 


(4.12-11) 


Because the D-SM receiver uses a matched filter as in DM, A 2 t/W 0 is given 
by (4.11-28): 


/JA = 2 A 2 t 
W D.SM -*o 


(4.12-12) 


On using (4.12-5) in (4.12-11) output signal-to-noise ratio becomes 



Here iS 0 /N g ) D . SM is given by (4.12-7). 


(4.12-13) 


4.13 ADAPTIVE DELTA MODULATION 

In previous discussions we found that a delta modulation system became 
overloaded when the instantaneous (short-term) slope of the message was 
too large. Specifically, if the signal slope exceeded the ratio of step size 
Sv to the time T, between samples, overload occurred. For a given sample 
rate <o s = 2tr/T s , we could reduce slope overload by increasing step size 
or decreasing the message input power level (and thereby reducing signal 
slope). Both measures are undesirable, because they give decreased per- 
formance. Another disadvantage in DM is its sensitivity to signal level 
variations. Even if slope overload is absent, the signal-to-granular noise 
power ratio decreases when signal power level decreases [see (4.11.5)]. 
Thus in DM there exists only a finite range of signal-to-noise ratios, which 


218 Baseband Digital Systems Chap. 4 

exceeds whatever minimum one might establish as acceptable performance 
(for example, a good telephone link requires a signal-to-noise ratio of about 
30 dB minimum). This finite interval is called the dynamic range. 

Adaptive delta modulation (ADM) is a variation of DM which offers 
relief from the above disadvantages by adapting the step size to accommodate 
changing signal conditions. If the input signal’s slope is large, step size is 
caused to increase, thereby reducing slope overload effects. On the other 
hand, step size is decreased if the message is changing slowly or decreases 
in power level, thereby reducing granular noise. This latter fact leads to 
one of the principal advantages of ADM, increased dynamic range. Clearly, 
from (4. 1 1-5), if ( 8vf decreases in proportion to decreases in message power 
level f 1 ^), we may cause signal-to-noise ratio to become constant, or nearly 
so, over some range of signal power values. In practice, 40 dB to 50 dB 
of such dynamic range extensions may readily be achieved. 
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would be similar to that of Fig. 4.10-1 but with all the components to the 
right of point B replaced by the receiver of Fig. 4.13-l(b). 

In the ADM system of the preceding paragraph, step size is varied 
to match signal conditions. Alternatively, step size can be kept constant 
by using a basic delta modulator and signal conditions at its input can be 
adjusted [42], When signal level is large, a variable-gain circuit decreases 
the effective signal level at the delta modulator’s input. Small signal levels 
result in large gains. The effect is a type of signal compression in the 
modulator. The inverse (expansion) operation is required in the receiver 
so the system uses a form of companding. The modulator’s gain control 
signal is generated in a (second) feedback path around the modulator. 
Performance of this type of ADM is about equal to other ADM systems, 
but it has the disadvantage that the system requires a divider (for expansion), 
a device which is not always convenient in practice. 

It is no doubt obvious that the central problem in ADM is the design 


System Block Diagram 

Step size in ADM can be changed by placing a variable-gain circuit 
in the feedback path of the basic DM generator (Fig. 4.10-1). The circuit 
may be made to give discrete [32, 37] or continuous [38-40] gain variations. 
Regardless of the type of gain variation, the main problem in ADM is the 
manner in which gain is controlled. At least two basic methods are possible. 
In the first, called forward control, a control voltage for step size adjustment 
is derived from the input message. In order for the receiver to have step 
size knowledge, this voltage must be transmitted over the channel separately 
from the ADM output pulse stream. A typical system [41] encodes the 
voltage in the form of a second DM pulse stream (at a low sample rate) 
and multiplexes the two streams in time. Disadvantages of the method are 
the need for 2-bit streams and the problems associated with separating the 
two streams at the receiver. 

The second method of controlling gain, called feedback control, does 
not have the above disadvantages. The applicable block diagram is illustrated 
in Fig. 4.13-l(a). It is the same as in Fig. 4.10-1 for DM (up to point A) 
except for the addition of the variable-gain circuit and the step-size controller. 
The purpose of the controller is to sense the slope condition of the message 
conveyed in the transmitted pulse train s ADM (f). If the slope is large, the 
controller output causes the variable-gain circuit to have a large gain. Loop 
step size is then large. If the slope is small, the controller output causes 
a small gain and thereby a small step size. Because the loop will force 
f q {t) to approximately equal fit), all that is necessary in the receiver for 
message recovery is to reproduce the operations in the feedback path of 
the encoder (modulator). Thus, the receiver blocks shown in (b) are identical 
to their modulator counterparts. The receiver for an overall ADM system 


Clock Pulses 
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Figure 4.13-1. (a) Adaptive delta modulator using feedback control and (b) the 
corresponding receiver decoder [2], 
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of the step-size controller. There are two important considerations. One 
is the functional law relating gain changes to signal slope changes. The 
other is the form of the implementation. In the following two subsections 
we discuss implementations. The natural course of discussion will involve 
the gain law as well. However, choice of a gain law is not inherent to any 
given implementation. The two can be separately chosen as far as principle 
goes. Some comments regarding an optimum gain law are included in the 
later consideration of noise performance of ADM. 

Instantaneous Step-Size Control 

When the variable-gain circuit is discrete, the usual method of im- 
plementing the step-size controller involves digital logic. Gain changes 
occur on a pulse-to-pulse basis and are said to be instantaneous. Several 
variations of design are possible [22, 32, 37, 43, 44]. In one approach [37] 
the controller has a 1-bit memory. By considering the fact that slope 
overload leads to a sequence of output pulses of identical polarity , it compares 
the polarity of each transmitted pulse with the immediately preceding one. 
If they have the same polarity the controller indexes the gain circuit to a 
larger gain value (typically by a factor P larger than the previous gain). If 
the polarities are not the same, a sign reversal has taken place and gain is 
indexed to a lower value (typically by a factor Q lower). This has sometimes 
been called constant factor delta modulation (CFDM). Jayant [37, 44] finds, 
from computer simulations with speech messages, that PQ = 1 and P ~ 
1.5 represent optimum conditions to minimize quantization error power. 
Later simulations of Zetterberg and Uddenfeldt [43], however, indicate that 
PQ = 1.05 and P = 1.1 are more optimum, giving both minimum quantization 
error and maximum dynamic range. The difference in the two results ap- 
parently is due to the difference in the messages assumed in the simulations 
[43]. On the other hand, from reference [43], signal-to-noise ratio is not a 
sensitive function of P when optimum Q is used. It changes by about 3 
dB as P changest from 1.1 to 1.5. Optimum PQ remains in the vicinity of 
1 . 1 . 

The above technique can be extended to controllers having more than 
a 1-bit memory [22, 32, 43]. Data of reference [43] indicate that, for up to 
3 bits of memory, final receiver output signal-to-noise ratio does not change 
greatly (less than 1.5 dB for 1.1 < P < 1.5 and the optimum Q in each 
case of P when o>JW f is near 13.7). The decision logic chosen was to 
increase gain by the factor P if, for k bits of memory, all k + 1 pulses 
were of the same sign and to decrease gain by the factor Q otherwise. 

Schindler [22] has shown that, even with no slope overload, there 
may be cases where two consecutive pulses transmitted may have the same 


polarity. On this basis he justifies that 2 (or more) bits of memory are 
better suited to control step size for nonstationary messages such as speech 
waveforms. A decision logic based on four total bits (3-bit memory) is 
given. It is most easily understood by assigning a 4-bit binary word to the 
four most recent pulse transmissions. If the most recent pulse corresponds 
to the least significant digit in the word, then gain is increased by 2 dB if 
the digital number is 0 or 15. Gain is increased by 1 dB if the number is 
4, 7, 8, or 12. Gain is decreased by 0.125 dB for all other numbers. In a 
practical system, the decreases are accumulated until a total decrease of 
1 dB is called for; the gain then is decreased by 1 dB. 

Syllabic Step-Size Control 

Many adaptive delta modulators do not change step size on a pulse- 
to-pulse basis. Rather, changes are made much more slowly. Since most 
research has been directed toward use of DM and ADM for speech encoding, 
such slow control is referred to as syllabic. The usual implementation 
involves a continuously-variable gain circuit controlled by an analog voltage 
from the step-size controller. This is often called continuously variable 
slope delta modulation (CVSD). The input to the controller is the ADM 
output pulse stream. This input may be processed either digitally [21, 39, 
45, 46] or with analog circuitry [29, 38-41, 47] to obtain the analog control 
voltage. 

The system of Greefkes and Riemens [46] is a good example of digital 
processing. Whenever the system output pulse stream contains four con- 
secutive pulses of either state (on or off unipolar rather than polar pulses 
were used in reference [46]), a dc level is initiated. The level is maintained 
for a time equal to the total number of consecutive sampling intervals that 
the pulse state is preserved. The result is a sequence of "pulses” which 
are averaged by passing through a lowpass filter (bandwidth of 100/V Hz) 
to obtain the slowly-varying control voltage. Clearly, the control voltage 
is proportional to the average fraction of time that the system output pulse 
stream contains four or more consecutive pulses of the same state. However, 
we may show that the voltage is also proportional to the fraction of time 
that the slope (derivative) of the message exceeds half the average slope 
capability of the ADM system. The proof consists in recognizing that, for 
four pulses, half the average slope capability is exceeded when the average 
number of pulses of any one state exceeds two. Only the case where all 
four pulses have the same state satisfies this condition. Greefkes and 
Riemens define the ratio of the signal derivative to the maximum permissible 
value of this derivative as modulation index. Thus the analog voltage at 
the output of the step-size controller is proportional to the average fraction 
of time that the modulation index exceeds j. 

A good example of an analog processor is the system of Tomozawa 


t These conclusions assume 7 s otj W, s 13.7. 
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and Kaneko [38]. First, the system output pulse stream is passed through 
an integrator to obtain a signal representing the modulated signal component 
contained in the coded pulses. The integrator output is then envelope- 
detected. The detector feeds a lowpass filter which produces a slowly 
varying output voltage proportional to the modulating signal average level. 
The lowpass filter has a time constant of about 5 ms or a bandwidth (3 dB) 
of 100/ir Hz. Thus the controller output is a voltage proportional to the 
average message voltage magnitude over a 5-ms time interval. The voltage, 
plus a small fixed bias, controls the gain of the variable gain circuit. The 
bias helps prevent instabilities [38] and prevents the variable gain from 
becoming zero when the detector output is zero. 

It should be observed that the analog control voltages in the above 
two example systems are different. One is related to message slope, the 
other is related to message magnitude. In the following subsection, we 
discuss noise performance assuming syllabic step-size control, and show, 
at least for stationary random signals, that either approach leads to the 
same performance. 

Quantization Noicc Performance of ADM 

The performance of a DM system may easily be extended to give the 
performance of ADM. We model the message as a zero-mean stationary 
Gaussian random signal in order to make use of previous work. The model 
will form a reasonable first approximation for other messages. The assumption 
of stationarity allows a fixed relationship to exist between signal power and 
the power in the signal derivative /(f) = df(t)/dt. The relationship is 

fit) = W 2 m> fU), (4.13-1) 

where is the rms bandwidth of the power density spectrum of fit), as 
given by (4.11-10). Thus a certain average slope implies a certain signal 
power, and vice versa. 

We treat ADM as DM with a variable step size 8v that is a function 
of message slope, or the equivalent, message level. Receiver output signal- 
to-quantization noise power ratio must again be given by either (4.11-19) 
or (4.11-22) if suitable account is made for a variable step size. Other 
approaches, although less direct, are also possible [48]. Suppose we begin 
by placing an upper bound or “reference” value 8v r on 8v by choosing the 
ratio of 8v r to sampling interval T, to be proportional to the rms slope of 
the message when it is at some reference level f r )t). In other words, 

8v r = (4.13-2) 

where it is a constant of proportionality. Equation (4.13-2) is just an extension 
of (4.11-17) for fixed 8v in delta modulation. 

Now the signal-to-quantization noise power ratio that we seek is given 


by 


N, 


q/ ADM 


m 

N g + N„ 


(4.13-3) 

where the granular noise power N g is given by (4.11-4) for any 8v, and the 
slope overload noise power N so is given by (4.11-14) for any 8v and signal 
power. In order to find N so , which is a function of Z as defined in (4.11- 
15), we first write Z 2 as 


Z 2 = 


(6u) 2 

T]f\t) 


8v 

8v r 


K 2 fit) 

fit) 


8v 

8v, 


2 K 2 f 2 (t) 

fit) ' 


(4.13-4) 


In obtaining this expression, T, from (4.13-2) has been substituted and (4.13- 
1) was used. In terms of x, previously defined as 


x 2 = 


fU)_ 

Kfrit) 


(4.13-5) 


we have 


Z 2 = 


8v\ 1 


8v r ) x 


(4.13-6) 


Substitution into (4. 1 1-14) gives the desired expr ession for N so . Performing 
somewhat similar operations on the ratio fit)/N e will allow us to write 
(4.13-3) as 



Here 


(4.13-7) 


/M = 3 fit) (o s = _j_ /_w^\ 2 uy 

Wadm, iSv r ) 2 W f 4tt 2 K 2 VWhJ \w f ) 


(4.13-8) 


is the reference signal-to-granular noise power ratio. In terms of minimum 
channel bandwidth NwJ 2, which is the same as in DM, we write 




(4.13-9) 
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We pause to examine our results. Observe that (4.13-7) is identical 
to (4.11-19) in form. In fact, x in the DM expression is replaced here by 
x(8v r /8v). Now since 8v r is the maximum value of 8v, and it occurs whenever 

fit) s* frit), then, for this condition, x 2 {8v r /8v) 2 here is identical to x 2 for 
DM Thus, whenever x 2 3= 1 /K 2 we expect the normalized signal-to-noise 
ratio of ADM to be the same as in DM. When x 2 < 1 /K 2 the two are 
different. In DM, (SjN q ) DM decreases as x 2 decreases. In ADM, (SJN q ) A dm 
may be made nearly constant as x 2 decreases if x 2 (8v r /8v) 2 is nearly constant. 

The condition required for fi8v r /8v) 2 to be constant is readily found. 
When 8v = 8v r , x 2 must equal l/K 2 . For 8v < 8v r we require 



or, on substitution of (4.13-5), 

8v = 8v r 



(4.13-11) 


In words, 8v should be proportional to [/ 2 (t)]' /2 , a condition which can be 
approximated in practice. 

Unfortunately a normalized plot of (4.13-9) is not possible, as was 
done for its DM counterpart (4.11-22), because 8v r /8v cannot readily be 
written in terms of x. We may, however, work toward developing some 
representative curves for a speech message to see the typical behavior of 
(4.13-9). First we assume the following approximation for (4.13-11): 


8v 

8v r 




fit) < fit) 
fit) ^fJifl- 


(4.13-12) 


Here 8v 0 is the smallest value that 8u is allowed to have. The ratio 8v r /8v 0 
when expressed in decibels may be called the dynamic range factor F*. 


That is, 


F* = 20 log, 0 



(4.13-13) 


Fjy is the maximum amount that system dynamic range is increased over 
that of a basic DM system. 

Next, we model the speech signal as a Gaussian random message 
having a simple lowpass power density spectrum. Below the 3-dB frequency 
(O = w 3 it is constant. Above W 3 it decreases at - 6 dB per octave frequency 
increase out to a point W f where it abruptly becomes zero for 
to > W f . For this model it is readily shown that 


(w^Y | 

r iw 3 /Wf) 

(wf 2 ' 

ii 

tan -1 iWf/Wf) 

\Wf) . 


For speech signals W r /W 3 is approximately 4.35, corresponding to W rms = 
0.344 W)- [30], 

Figure 4.13-2 has been constructed from (4.13-9) for a single message 
using (4.13-12), W rms = 0.344W / , W ch = 10W,, and K = 2.985, which is 
the value that corresponds to the maximum point in Fig. 4.11-1. The effects 
of dynamic range factors of 10, 20, 30, and 40 dB are shown. For comparison, 
the response of a basic delta modulation system is shown dashed. 


Example 4.13-1 

For the system of Fig. 4.13-2 we calculate the increase in dynamic range of the 
ADM system relative to the DM system when the minimum required signal-to-noise 
ratio is 15 dB. F^ = 40 dB is assumed. 

In the slope overload region, (SjN q ) of 15 dB occurs for both systems at a 
relative signal strength of +6.8 dB. For smaller signal levels the DM and ADM 
systems reach 15 dB at relative signal strengths of -12.5 dB and -50.5 dB, 
respectively. Thus the dynamic range of DM is 6.8 + 12.5 = 19.3 dB. That of 
ADM is 6.8 + 50.5 = 57.3 dB. Dynamic range extension becomes 57.3 - 
19.3 = 38.0 dB, or slightly below the maximum of 40 dB. 


Hybrid Configurations 

The principal configurations of ADM use either instantaneous or syllabic 
step-size control. It is possible, however, to combine both types of control 
to form what is called hybrid companding delta modulation (HCDM). The 
basic technique of Un and Magill [49] as modified by Un and Lee [50] has 



t=++ in dB 
fft) 


Figure 4.13-2. Receiver output signal-to-noise ratio in adaptive data modulation. 
A speech message is assumed for which = 0.344W>[2]. 
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been compared to other ADM systems [51]; performance is claimed to be 
slightly superior (2 to 3 dB) to CVSD or CFDM systems. 

The hybrid scheme can be modified to use more than one sample rate 
[52]. Buffers are used so that the channel pulse rate is maintained constant. 
Compared to fixed-rate HCDM, signal- to-quantization noise ratio can be 
improved by 3 dB to 4 dB [52]. 


4.14 DIFFERENTIAL PCM 

From our earlier study of a delta modulator, we found that each transmitted 
pulse corresponded to a 1-bit encoding of the difference between the message 
f(t) and an approximation f q (t) to it. The overall effect of the pulse modulator 
(Fig. 4.10-1) was to quantize the difference signal into two levels, sample 
the quantized difference, and generate a 1-bit coded pulse stream having 
one pulse per sample. If, instead, the difference signal is quantized into 
multiple levels, sampled, and each sample coded into an AZ^-bit binary code, 
the result is called differential pulse code modulation (DPCM). The code 
is transmitted over the channel as binary pulses with N b pulses per sample. 
DPCM is sometimes known as delta PCM. 

DPCM System Block Diagram 

DPCM systems may take on many forms. We shall discuss only the 
most elementary version and reserve comments on the variations for the 
end of this section. A simple DPCM modulator is illustrated in Fig. 4. 14- 
Ha). The demodulator is shown in (b). When these components replace 
their counterparts in Fig. 4.10-1, a DPCM system is formed. Consider first 
the modulator. The error - e,(f) is rounded off to the nearest of 2 * equally 
spaced levels where N b is a positive integer. The quantized version of 
-e (t) acts as a modulating signal to the pulse amplitude modulator. By 
assuming the clock pulses have a short duration and constant amplitude, 
the modulator output is a train of amplitude-modulated pulses. Each pulse 
becomes a quantized sample of the error -e,(f). Samples occur at the 
sample rate w s (every T, = 2t r/co s seconds). The PCM coder converts each 
sample pulse into an N„-bit binary pulse sequence which is transmitted 
over the channel. The sample pulses are also applied to the integrator 
which closes the loop by producing /,(/) at its output. 

Since the sample pulses are narrow, the integrator generates a staircase 
waveform as in a basic delta modulation system except there are now 
2 Ni possible step sizes available (including sign). The steps are [53] 

±8u 0 » ± 38<>o, -., ± a Nb ~ 3)Suo, ± (2 N ‘ - ( 4 - 14 ;D 

where Su 0 is the magnitude of the smallest step. Steps and pulse polarities 
are related through a simple binary code. Figure 4.14-2 sketches some 
typical waveforms of interest in generating a DPCM signal. A 2-bit code 



J DPCM (*) 

(b) 

Figure 4.14-1. Block diagrams applicable to a differential PCM system, (a) Mod- 
ulator and (b) demodulator. The components fit into Fig. 4.10-1 to the left of 
point A and to the right of point B, respectively, to form a DPCM system [2]. 

is assumed; that is, N b = 2. Quantization error magnitude |e,(f)| may be 
as large as 38u 0 > as shown in (c). The pulse stream of (b) assumes that 
steps of 38o 0 , 8d 0 , -Sv 0 , and -38u 0 are coded with pulses having polarities 

+ + , + - , - + , and , respectively. 

More generally, quantization error magnitude can be as large as 
(2** - l)8o 0 , and f q {t) may still remain a good approximation to /(f). Note, 
however, that periods of large error correspond to rapid changes occurring 
in the message. Thus if the signal’s slope exceeds the maximum slope 
capability of the system, we may have slope overload just as in DM. To 
prevent slope overload 

l/WLax = M/MMImax « ~ ~ (4.14-2) 

■* s 

is required if the message has a maximum slope. If not, we may select a 
constant K such that 



These two conditions are analogous to (4.10-3) and (4.11-17) for DM. 
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66u 0 


Figure 4.14-2. Waveforms applicable to DPCM. (a) /(/) and/,(t). (b) DPCM 
pulse train, and (c) the error signal [2). 

At the receiver end of the system, a PCM decoder recovers^ the 
amplitude-modulated sample pulses which, when integrated, become f q (t), 
which is approximately equal to f q (t) back at the modulator. Except for 
receiver noise errors it would be the same. The lowpass filter smooths out 
the sharp steps in /„(/), and the response is /(f) pulse noise. As in DM 
the noise has granular, slope overload, and receiver noise components. 


Channel Bandwidth 

The channel bandwidth required in DPCM is that of an N b -bit PCM 
system. Assuming N similar messages are time-multiplexed, we have 

w > 1 > = N ? JVi (4.14-4) 

ch " T " r s 2 

Here the PCM pulses have duration t. 
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Performance with Granular Noise Only 


(4.14-5) 


(4.14-6) 


^"-1 ai analyses may dc cueu wmcn aea] witn tne vanous noise 
components in the final receiver output signal [27, 28, 31, 53], If only 
granular noise is present, which corresponds to times when the message 
is slowly varying such that the system easily follows the changes, van de 
Weg [53] has determined the output signal-to-granular noise power ratio. 
By using a random message model he obtains 

(^)dpcm = 2 k fe)’ Nb * 2 ’ (4 ' 14 ' 5) 

where W f is the maximum spectral extent of the signal and 

k = (8u 0 )7/ 2 M. (4.14-6) 

Equation (4.14-5) is valid as a first approximation independent of the power 
spectrum of the message [53, p. 383], and van de Weg even justifies its 
application to a nonrandom sinusoidal signal [53, p. 384], Thus we may 
apply (4.14-5) in a broad sense. 

For signals having a definite maximum magnitude it is convenient to 
set the maximum step size (2^* — l)8i>o to prevent slope overload according 
to the maximum or reference slope value /,(/) of /(f): 

l/Wlmax = — ~ 2 1)Sl) " ] • ( 4 . 14 - 7 ) 

*■ s 

For messages having no well-defined maximum magnitude, such as Gaussian 
random signals, we may choose the maximum step size such that system 
slope capability is larger than mean-squared signal slope by some factor 
K 2 > 1: 


_ K2" 4 


k 2 m = 


(4.14-8) 


For this latter case k is easily evaluated. Substitution into (4.14-5) gives 

m _ 3(2*» - D 2 ( Wf y /vv m 

J^g/ DPCM 


3(2^ - l) 2 
Sn 2 K 2 


N b ^2. 


(4.14-9) 


For the former case, we only replace K 2 by k 2 cr , the slope crest factor 
defined in (4.11-9). W rms is defined in (4.11-10). 

We note that (SJN g ) DPCM increases as co/ which was also found for 
DM, D-SM, and ADM systems. If N b = 1 we might expect that (4.14-9) 
would give results near those for DM. However, the value of (4.14-9) is 
actually 3 dB smaller than obtained in DM. This occurs because the derivation 
of (4.14-9) neglects correlation between successive steps when N b = 1. 
For N b 3= 2 the correlation becomes small [53]. Other important observations 
are that performance depends on the signal’s power spectrum through the 


228 


Baseband Digital Systems Chap. 4 


term (W f / W m f. The only system parameters! appearing explicitly in (4. 14- 
9) are cu, and N b . Increasing either leads to better performance. 

We now give three examples to illustrate use of (4.14-9) for three 
different information signals. 


Example 4.14-1 

Let us assume a random noiselike information signal having a uniform (constant) 


power density over — Wf < a> < Wf and a slope crest fa ctor K„ — K = 4. On 
using (4.11-10) we find wL = W}/ 3. From (4.14-9) with/ 5 )/) = f](t) 


SA _ 9(2 Nt - l) 2 / a>'Y 
.^k/dpcm 128tt \W// 


N b » 2 . 


If we now let N b = 2 and a ) S /W f — 10, we have (SJN t ) dpcm - 64.1, or 18.1 dB. 


Example 4.14-2 

Suppose/)/) is a sinusoid of frequency <o f s W f . From (4.1 1-9) we easil y find that 
k] r = 2. From Prob. 4-53 W^ = a> f . From (4.14-9) with/ 2 )/) = f)(t) 


sa 3(2 w> - 1) ; 

Ng ) dpcm 16rr \ &, // 


N b » 2. 


Example 4.14-3 

As we have previously stated, results for delta modulation of speech signals may 
be found from the sinusoidal signal equation if oi//2tt = 800 Hz and W f / 2tt = 3.5 
kHz. By using these values in the equation of Example 4.14-2 we obtain 


AA _ 3.59(2 W ‘ - l) 2 UV 
Ng) dpcm n W 


N b »2. 


Again letting N b — 2 and m,/ Wf = 10, we calculate (S 0 /N,)dpcm — 3274, or 35.2 
dB. By comparing with Example 4.14-1 we see that considerable difference in 
results can occur for different signal characteristics. 


Performance Comparison with DM and PCM 

DPCM is readily compared with DM. On the basis of equal channel 
bandwidth, the same values of K, and equal message characteristics, we 
obtain 

(Zl) , (4.14-10) 

\N s /dpcm \N g / DM 

which shows that DPCM is superior to DM if N b 2= 4. 

Van de Weg [53] has compared DPCM performance with that of PCM 
for a random message having a flat power spectrum and a slope crest factor 
of four. By assuming equal values of u> s and the same number of bits per 
sample interval, it is found that DPCM performance is equal to or better 


t k is a parameter related to both the system and the message. 
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than that of PCM when <o s /W f 4 and N b & 4. On the other hand, for 
speech signals DPCM is superior to PCM for all values of oj s /W f and N b 
when the comparison again assumes equal values of wJW f and N b . 

Performance with Slope Overload Noise Added 


When receiver noise is negligible the total quantization noise power 
at the receiver output is the sum of granular and slope overload noise 
powers. O’Neal and Rice [31] have determined slope overload noise for a 
delta modulation system (N b = 1). According to O’Neal [31], slope overload 
noise is determined from the delta modulation result by replacing the DM 
system step size by the maximum value ( 2 Nb - l)8i> 0 in the multilevel 
quantization (DPCM) case. Thus if we apply the same procedure to Abate’s 
slope overload noise power which is given by (4.1 1-14), algebraic manipulation 
assuming random messages reveals 


x k 2 (SJ N glnpcM , 

(2*-i)» /».y 

9 \Wf) K 


Here K is determined by (4.14-8), x is given by 



(4.14-12) 


f\t) i s the power in the message, which may be different from the reference 
level /,(/), and (SJN H ) DKM r is given by 


J^gl DPCM.r 


3(2* - l) 2 / W f V /oj,y 
8 t t 2 K 2 U r J [wj 


N b ^2. 


(4.14-13) 


A plot of (4.14-11) for a fixed value of N b shows a behavior versus x similar 
to that for DM given by (4.11-19). 

Although we shall omit a discussion of receiver (channel) noise effects, 
it is known [54] that performance for a well-designed DPCM system is 
considerably better than that of a well-designed PCM system operating on 
the same digital channel, even if the channel is noisy. 


Other System Configurations 

As with most basic modulation techniques, additional complexity may 
lead to better performance. Such is the case with DPCM. Good discussions 
of the more advanced DPCM system configurations which result are given 
by Jayant [44] and Bayless et al. [55], Our comments shall refer to data 
in reference [44], For original sources and a good list of other references 
the reader is referred to Jayant. 

We found that, by using a more complicated linear feedback network 
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in a DM system, performance could be improved. The network could be 
either stationary (double versus single integration) or time-varying (adaptive 
DM). The analogous configurations also exist in DPCM. If the basic DPCM 
integrator is viewed as a prediction circuit, a more general predictor can 
be implemented that is either stationary or adaptive. Typical stationary or 
fixed predictors involve feedback networks which are specified by n constant 
parameters. The optimum values of these parameters are related to the 
correlation (and therefore spectral) properties of the message. The case 
n = 1 corresponds to the basic DPCM system. In practice n « 3 would 
be implemented because little performance improvement follows n > 3. 
For example, with n = 3 and speech messages, DPCM system performance 
(receiver output signal-to-noise ratio) is about 8.4 dB better than in a PCM 
system [44]. Only a 0.2-dB additional gain is obtained when n is increased 
to 5. 

If the correlation properties of the message change with time, such 
as with speech waveforms, the optimum predictor must change with time. 
Thus the n parameters of the predictor may be made adaptive by performing 
periodic (typically on the order of every 4 ms for speech) analysis of short 
segments of the message to derive new optimum values. Such an adaptive 
DPCM system for n = 3 and speech signals is capable of about 1.6 dB 
improvement over a basic DPCM system with n = 3. For n — 5 and 10 
the gains increase to 2.9 dB and 4.0 dB, respectively. 

Finally, we note that configurations are possible which alter the quantizer 
characteristics. Because a DPCM system can slope overload similar to 
DM, the quantizer may be made adaptive such that a variable maximum 
step size (quantization level) occurs to accommodate signals which vary 
greatly in level (and therefore slope). 

4.15 SUMMARY AND DISCUSSION 

The principal theme of this chapter is the description of binary digicom 
systems that use baseband waveforms for channel transmission. Initially, 
the general optimum binary system is determined for a channel having 
white Gaussian noise and an infinite bandwidth. The correlation form of 
optimum receiver is shown in Fig. 4.2-1, where V T is given by (4.2-12). An 
equivalent matched filter form of receiver is shown in Fig. 4.2-2. The 
general probability of bit error in these systems is given by (4.3-5). 

Specializations are developed for the optimum binary system in Secs. 
4.4 and 4.5 for specific unipolar, polar, and Manchester formats, as examples. 
The optimum demodulators (receivers) are shown in Fig. 4.4-3. The average 
probability of a bit error P r is given by (4.4-6), which is plotted in Fig, 
4.4-4. Differentially encoded data are also examined with P e now given by 
(4.4-8). Effects of coding of digital data are discussed. In addition, the 
transmission of analog messages over the PCM digital system is developed 
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in detail. Receiver output signal-to-noise ratio is given by (4.5-21) and 
plotted in Fig. 4.5-2. 

The effects of a bandlimited channel are next discussed (intersymbol 
interference) and ways of either eliminating the effects (Nyquist’s pulse 
shaping) or controlling it (partial response signaling and channel equalization) 
are developed. In the latter category, two optimum systems (duobinary 
and modified duobinary) are discussed in detail. The block diagram for 
both duobinary and modified duobinary systems is shown in Fig. 4.7-l(a) 
where the optimum duobinary filters are defined in (4.7-15) and (4.7-16). 
For the modified duobinary system it is necessary to replace P 2 (co) by P } (co) 
as defined in (4.6-11). Average probability of bit errors in these two systems 
is given by (4.7-7) using (4.7-23). 

An example of a multi-level baseband system is included in Sec. 4.9, 
where M-ary PAM is discussed. Average probability of a word (symbol) 
error is found in (4.9-18) and plotted in Fig. 4.9-2. When compared with 
a binary system having the same word error probability, the M-ary system 
is found to require more average power; this fact is true when the systems 
have either the same information rate or have the same bandwidth. The 
main advantage of M-ary PAM is its smaller channel bandwidth required 
to transmit the same information rate (at same word error probability) as 
the binary system, its main disadvantage is the extra average power required 
to achieve the stated advantage. 

In the remainder of this chapter, Secs. 4.10-4.14, several types of 
modulation systems are described that can convert an analog message directly 
to a digital waveform for channel transmission. These modulators combine 
in one step the functions of sampling, quantization, source encoding, and 
waveform formatting. They do not involve any channel encoding. The 
delta modulation (DM) system is best suited to messages with a power 
spectrum proportional to l/w~, while the delta-sigma modulation (D-SM) 
system is best suited to signals with a constant power spectrum. The former 
spectrum form is a fair approximation for speech and some television signals. 
The adaptive delta modulation (ADM) system of Sec. 4.13 is a variation 
of DM that allows the receiver’s output signal-to-noise ratio to be nearly 
constant over a range of input signal-to-noise ratios. It, therefore, improves 
the dynamic range achievable in the delta modulator. DM, D-SM, and 
ADM modulators all use a simple 1-bit encoding procedure (an error in a 
feedback loop is actually encoded). The differential pulse code modulation 
(DPCM) system of Sec. 4.14 is a variation of DM that uses a multi-bit 
encoding procedure. 

Final receiver output noise powers in the DM, D-SM, ADM, and 
DPCM systems all contain component powers mainly of three types: (1) a 
granular noise power that is always present (it is due to the encoder’s 
quantization noise), (2) another noise that is always present, called receiver 
noise (due to channel noise), and (3) a power component due to slope 
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overload noise; this noise occurs only in time intervals where the transmitter s 
encoder is not capable of following rapid changes in the message’s amplitude 
(above those for which the encoder was designed). Although basic procedures 
are given to compute all these noise components, only certain cases are 
given as examples. Output signal power-to-granular noise power ratio for 
DM, D-SM, and DPCM systems are given by (4.11-11), (4.12-7), and (4.14- 
9), respectively. Signal-to-noise power ratios based on output noise powers 
being the sums of granular and slope overload noise powers are given by 
(4.11-19), (4.13-7), and (4.14-11) for DM, ADM, and DPCM systems, re- 
spectively. Finally, the output ratio of signal powers to the sum of granular 
and receiver noise powers for DM and D-SM systems are, respectively, 
given by (4.11-31) and (4.12-13). Broadly speaking, the adaptive systems 
are preferred for their increased dynamic range while multi-bit encoding 
(DPCM) leads to improved performance. Thus recent emphasis in practical 
systems has been toward adaptive versions of DPCM. 

Finally, it is noted that some recent books are recommended for 
additional reading on the topics of this chapter [56-58]. 
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4 - 2 . Explain in words why (4.1-16) is true. 

4 - 3 . An optimum binary system uses a unipolar-NRZ format with rectangular 
pulses. Pulse amplitude at the receiver is 1.5 V. Pulse duration is 2 >s, and 
white channel noise density is JfJ2 = 5(1(T 7 ) W/Hz. If optimum threshold 
V r , as given by (4.2-12), is 3(10' 6 ) J, find message probabilities P, and P 2 . 

4 - 4 . Show that (4.2-17) is the Fourier transform of (4.2-14). 

* 4-5 A certain binary system seeks to generate s 2 (t) by switching a continuously 
running oscillator 3 cos (a> 0 t) to the channel for each binary 1. For a binary 
0 sx(t) = 0 If bit intervals of duration T b are defined such that one starts 
a u = 0, determine how a> 0 must be related to T„ if the system is to be 
optimum. 

4 - 6 . A binary transmitter for equally probable messages uses the waveform 


0 « t « T» 


elsewhere 


(a single pulse) when a binary 0 occurs, and 
[5 sin 2 (<D(,f), 0 


(5 sin ! (n)(,r), 0 « t T b 

Sl ^ [o, elsewhere 

for a binary 1. (a) Find the energies E, and E 2 in these waveforms, (b) Find 
y . (c) Find the optimum receiver output levels V, and V 2 . (d) What value 
of jV 0 for the channel will give a minimum peak output signal power (square 
of signal level at sampler at sample time) to average noise power ratio of 
1000 (or 30 dB)? 

4 - 7 . Use Schwarz’s inequality to show that y of (4.2-23) satisfies - 1 ^ y — 

4 - 8 . Use (4.2-24) through (4.2-26) and show that the optimum binary system’s 
threshold of (4.2-12) can be written as 

V, + V, o - 1 ln(P,/P 2 ) 

v r = -V- + -^T- 

4 - 9 . In an optimum binary system V, = - 1 V, V 2 = 5 V, V r = 1.9 V, and <r 0 - 
0.5 V. (a) Find message probabilities Pi and P 2 . (b) Find average error 
probability P t . (Hint: Use results of Prob. 4-8.) 

4 - 10 . In an optimum binary digital system the two possible transmitted waveforms 
occur with equal probabilities and have equal energies. The ratio of average 
energy per bit to W 0 is 10.4 and the system’s average bit error probability 
is 1.18(10' 4 ). Find y for the two waveforms. 

4 - 11 . What would the average probability of a bit error become for the system of 
Prob. 4-10 if the transmitted waveforms were adjusted to be antipodal (energies 
are kept the same)? 

4 - 12 . An optimum unipolar-NRZ binary system transmits 12-V ^angular pulses 
of duration 1 ms over a channel for which = 4(10 ) W/Hz. Messages 
occur with probabilities P, = P(m,) = 0.9 and P 2 = P(m 2 ) = 0.1. (a) Find 
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optimum receiver threshold V T . (b) Find the average transmitted pulse’s 
energy, (c) Find e. (d) Find average bit error probability P,. 

4 - 13 . Work Prob. 4-12 if the system uses a polar-NRZ format. 

4 - 14 . A binary system has equally probable messages and uses a Manchester 
waveform format where e = 20. The binary data are differentially encoded 
at the modulator prior to waveform formatting, (a) Find the receiver de- 
modulator’s average bit error probability, (b) What is the probability of a 
bit error after differential decoding? (c) What is the receiver’s optimum 
threshold? 

4-15. In a differential decoder for binary systems the decoded output is correct if 
the current and earlier adjacent bits are both in error. Explain why. 

4 - 16 . Use (4.4-11) to determine the smallest channel bandwidth that will guarantee 
that the decision level D is within 99% of the value it would have if channel 
bandwidth were infinite. 

4 - 17 . A binary PCM system uses a 4-bit natural binary source encoder and no 
channel encoding. It has an average bit error probability of 0.01. (a) What 
is the system’s word error probability? (b) If enough additional transmitter 
power is installed to reduce bit error probability to 0.001 what is the word 
error probability? 

4 - 18 . A binary polar-NRZ PCM system can transmit either 4-bit uncoded words 
during a time interval T or it can transmit a double-error correcting (10, 4) 
code during the same time T. In each case the peak transmitted powers and 
channel noise densities are the same, (a) If P, = 2.339(10~ 3 ) in the uncoded 
case, find the word error probabilities for both coded and uncoded trans- 
missions. (b) Which of the two systems would you use? 

4 - 19 . Work Prob. 4-18 if P r = 1 . 18(10“ 4 ) in the uncoded case. 

4 - 20 . In writing (4.5-2) series terms above the first were assumed negligible and 
(1 ~ PiY ' 1 s * 1 in the first term, (a) Assume a (7, 4) single-error correcting 
code and find an expression in terms of P, for the ratio of the second term 
(first neglected term) to the exact first term, (b) Evaluate your expression 
forP, = 10 ',10 2 , and 10 '. (c) What can you conclude about the approximate 
formula’s accuracy from your results? 

4-21. In a polar-NRZ binary PCM system that uses a unif orm quantizer with an 
analog message for which_|/(r)L* = 0.7 V and f\t) = 0.01 W, the output 
signal-to-quantization noise power ratio is 256.8(10 3 ) (or 54.1 dB) when the 
message power is 6 dB below its reference level. Channel noise is negligible, 
(a) What is the message’s crest factor? (b) How many levels does the 
quantizer implement? (c) What is the quantizer’s step size? 

4 - 22 . A message having symmetrical fluctuations, zero mean, and a crest factor 
of 3.4 is uniformly quantized in a PCM system to quantum levels ±0.005 
V, ±0.015 V, ±0.025 V, ..., ±1.275 V. (a) What maximum signal-to- 
quantization noise power ratio is possible if no amplitude overload is to 
occur? (b) Repeat part (a) if the quantizer’s input is a sinusoidal message 
instead, (c) For the sinusoidal message let the peak amplitude be 1.6 V. 
What fraction of time will amplitude overload occur? 
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4-23. Follow the procedures of the text to show that expressions analogous to 
(4.5-16), (4.5-17), and (4.5-21) for unipolar PCM with equally probable messages 

are 



unipolar-NRZ PCM 


S 0 \ (Sq/N<,) pcm 

^o) pcm ” 1 + 2 2Nt+1 erfc[VA 2 r ( ,/4W 0 ]’ 


unipolar-NRZ PCM 



unipolar-NRZ PCM. 


4-24. If channel noise in the PCM system of Prob. 4-21 is not negligible and 
(3 ,/M) pcm = 18.0, find (SJN,) pcm. Justify that your result agrees with that 
found from Fig. 4.5-2 by suitable scaling. 

4-25. Explain why binary polar PCM uses less average power than unipolar PCM 
when the two have the same peak-to-peak amplitude separation between 
their pulse levels. Assume equally probable messages. 

4-26. Two binary PCM systems, one unipolar and the other polar, are to produce 
the same average bit error probability when using the same channel and 
having the same bit durations. Find the necessary relationship between 
transmitted pulse amplitudes in the two systems. Assume equally probable 
messages. 

4-27. Use results from Prob. 4-23 to find an expression for the 1-dB threshold 
signal-to-noise ratio in a unipolar PCM system with equally probable messages. 
Plot your results versus N b for 2 « N b 20. [Hint: Calculate N b for various 

(S f /iV,)pcM.rt'] 

4-28. A single analog message is transmitted by a binary polar-NRZ PCM system 
with a uniform quantizer. The message has a crest factor of V 2 and ls 
sampled every 50 p.s, which is its Nyquist period, (a) What is the largest 
number of bits per word that can be used that will guarantee performance 
at or above threshold for an input peak signal-to-noise ratio of 12 diU 
(b) What will be the maximum output signal-to-noise ratio? (c) What is l b - 

4-29. Inverse Fourier transform (4.6-4) to show that (4.6-5) is true. 

4-30. (a) Unit-amplitude pulses of the form shown in Fig. 4.6-2 for W,/W = 0 are 
used to transmit bits in a polar PCM system. Sketch the waveform for a 
bit sequence ...11010011100.... Use at least one sidelobe for each pulse in 
the sketch. Assume T b = n/W. (b) Repeat part (a) except use the waveform 
for W,/W = 1. 

4-31. Show that a PCM waveform comprised of pulses of the form of Fig. 4.6-. 
with W\/W = 0 can be passed through a filter matched to the pulse shape 
(white receiver noise) without causing intersymbol interference in samples 
taken every bit interval. 

★4-32. Let a PCM waveform comprised of pulses of the form of Fig. 4.6-2 with 
W,/VV = 1.0 be applied to a receiver filter matched to the shape ot the 
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pulses (white channel noise). Show that samples of the filter’s output taken 
every bit interval do have intersymbol interference. However, show also 
that interference occurs only between adjacent bit intervals. 

4-33. (a) Use pulses of the form of (4.6-8) and sketch the polar PCM waveform 
for the digit sequence . .11010011100. ... (b) Verify that samples of the waveform 
taken at the end of each bit interval give the correct sequence when decoded 
according to the logic given in the text (no precoding). 

4-34. Work Prob. 4-33 for the waveform of (4.6-10). The waveform is modified 
duobinary. 

4-35. Show that p 3 (r), as given in Example 4.6-1, is correct; that is, supply the 
missing steps in the development. 

4-36. The channel response in a binary system has a format made up from Nyquist 
(sampling) pulses defined by 


where co b = 2tt/ T b and T b is the bit interval’s duration. The formatted 
waveform is free of intersymbol interference. If the channel has distortion 
such that its response pulses are characterized by the spectrum 


H,(oj) = H„(ai)i 1 + 2 [a n cos(noiT b ) + jb„sin(na>T b )] }, 


where a„ and b„ are real constants, find the response s c (t) in terms of s 0 (r). 
Show that s c (t) contains intersymbol interference and determine which sample 
times (at multiples of T b ) contain the interference. Discuss how the constants 
a n and b„ affect the interference. 

4-37. A typical pulse p c (t) emerging from the channel of a binary system has 
sample values p c (-2T b ) = -0.05p c *(0), p c (-T„ ) = O.l/UO), p c (0) = 0.9 
Pc/i(0). Pc(T b ) = — 0. 15p r/l (0), and p c (2T b ) = 0.02p cfl (0). p c (t) is zero at all 
other possible sample times. Here p ch ( 0) is the maximum amplitude of the 
desired channel response. Find the coefficients required in a 3-tap channel 
equalizer. Find the values of the equalized channel response waveform at 
all possible sample times. 

4-38. From (4.7-5a), threshold in an optimum duobinary system is a function of 
both sample time, peak signal power A 2 , and noise power o$. How large 
must A 2 /<rl be in order that the threshold be within 10% of its limiting value 
that is independent of noise? 

4-39. Use (4.7-5) with (4.7-1) to show that (4.7-6) is true. 

4-40. In an optimum duobinary system the channel has a transfer function that 
can be approximated by 

HcM = 1 + (a >/W ch f 

Noise from the channel can be taken as white while the receiver’s output 
pulse shape is to be determined by the channel and identical filters at transmitter 
and receiver. If the system bit rate is a> b /2v, sketch the required receiver 
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transfer function’s magnitude squared, |// r (cd)| 2 , for W c Ju b = 0.25, 0.5, 1.0, 
and 2.0. 

■*4-41. An optimum duobinary system, originally designed for identical transmit and 
receive filters with a distortion-free white noise channel, gave a bit error 
probability of 2.234(10~ 3 ). It is modified to operate with the filters and 
channel defined in Prob. 4-40 with W ck = w b . If bit duration, average transmitted 
power, and channel noise level remain the same, what is the new bit error 
probability? 

4-42. In an optimum duobinary system the transmitted average energy per bit 
divided by JV 0 (which is s) is 1.459(10 7 ). Average bit error probability is 
1.6568(10“ s ). The system operates on a lossy but distortion-free white noise 
channel. What is the gain of the channel? 

4-43. Reduce (4.7-23) for an optimum modified duobinary system and show that 
(4.7-25) applies without change. 

4.44. Work Prob. 4-42 for an optimum modified duobinary system. 

4-45. Expand (4.9-2) and show that (4.9-3) is valid. 

*4-46. When messages are not equally probable in an M-ary PAM system, derive 
expressions for the optimum thresholds that are now functions of noise 
power, message probabilities P(V<) as well as levels 3}. 

4-47. An M-ary PAM system uses M = 32 levels uniformly and symmetrically 
displaced about zero. The largest possible receiver level at a sample time 
is 2.48 V while the average noise power level is oi = 4.3896(10 4 ) W. 

(a) What is the separation between the possible received signal levels? 

(b) What is the system’s average probability of error P w ? 

4-48. For an optimum M-ary PAM receiver that uses a white noise matched filter, 
as defined by (4.9-10), show that the filter’s output levels V, at the sample 
times are equal to the input pulse peak amplitudes A,, (b) Show that the 
output noise power is given by (4.9-14). 

4-49. If N, is defined as the “input” average noise power to the sampler in an 
optimum M-ary PAM receiver, show that word error probability can be 
written as 



where we define 

/sA 4 
m-pam M 

4-50. A large power plant uses a single word-interleaved time-multiplexed line to 
send many control and monitoring signals to a main control console. The 
line has 500 equal-length time slots, each carrying 4-bit natural binary words 
that are Nyquist-rate samples of similar narrowband source messages of 10- 
Hz spectral extent. Word error probability is 1.4862(10' 6 ). The multiplexer s 
transmitter is operating linearly with polar format with pulses at peak power 
capability. All available channel bandwidth is being used. The multiplexer 
is to be modified to handle twice as many similar messages, and the modulation 
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is changed to 16-level PAM (one level per 4-bit word) with its peak power 
level raised to 20 times that of the original system, (a) What is the frame 
duration of the 1000 multiplexed signals? (b) What is the duration of an 
M-ary symbol? (c) What is the M-ary PAM system word error probability? 
(d) Why is it necessary to convert to an M-ary PAM system? 

4-51. In a basic DM system, 8u = 0. 1 V and T s = 20 /t s. (a) Based on examination 
of the message 

/(') = A[1.2 cos(400irt + 0,) + 1.3 cos(600w/ + 0 2 ) 

+ 0.5 cos(1200irt + 0,)], 

where 0,, 0 2 , and 0 3 are phase angles that can have any values, find the 
largest value of A that will not cause slope overload, (b) Which frequency 
is most im portant in overload control for this message? 

4-52. Show that [df(t)/dtf/f\t) = W™ s if is given by the right side of (4.11- 
10). (Hint: Consider the power into and out of a differentiation network.) 

4-53. Using (4. 1 1-10) show that the rms bandwidth for a sinusoidal signal of frequency 
a>f is (Of. 

4-54. A message /(/) has a power spectrum 

9X„).B 

[0, elsewhere, 

where B, W,, and W f are positive constants, (a) Find the power f\t) in the 
message, (b) Find the message’s rms bandwidth, (c) If the message, for 
which slope crest factor is 5, W l /2tt = 100 Hz, and 13}/2 tt = 3.5 kHz, is 
transmitted over a basic DM system with a relative pulse rate ui s /W f = 40, 
what maximum value of (SJN g ) DM is achievable without slope overload? 

4-55. Work Prob. 4-54 for the power spectrum 

^M^rect^J-Brect^). 

4-56. A delta modulation system uses a nearly ideal single integrator that is replaced 
by a double integrator. Determine the improvement in output signal-to- 
granular noise power ratio that can be expected for sampling rates of w, = 

5 W f , 10B}, 2014}, and 4013} when a sinusoidal message is transmitted. 

4-57. By assuming operation above threshold in both PCM and DM systems, show 
that 

(So/N q ) PCM = 77 2 2 Wt /^A 2 (W^V 
(SJN g ) D „ 2 Nl \Kj \ Wf) • 

where N b is the number of pulses in a PCM system sample. Assume Nyquist 
sampling in the PCM case and equal channel bandwidths that are a common 
factor A ch > 1 larger than the minimums. That is, assume W cb = A ch tt/t, 
where r is the largest allowable pulse duration in each system (no guard 
times, NRZ PCM format). 

4-58. Assume sinusoidal modulation and determine how much poorer a DM system 
is than an 8-bit PCM system using the result of Prob. 4-57. 
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By use of (4.11-22) show that the (optimum) values of x which define the 
maximum points in Fig. 4.11-1 are given by 


—*= 4(^)1 

*opt [N V 


for multiplexing of N similar signals using delta modulation. 

A delta modulation system is designed to transmit three similar messages 
by time multiplexing. What is the optimum value of K to be used if W ch /V/ f 
= 81.9? For messages characterized by W f /W mi = 2, what optimum output 
signal-to-noise ratio can be achieved? Assume equalities in (4.10-6) apply. 
Work Prob. 4-60 for W c JW f - 30.13. 

By using (4.11-32) show that the threshold value (1-dB definition) of input 
peak signal-to-noise ratio in DM is 


= 2-i erfc~ 


A single-integrator delta modulation system is to be optimally designed (Prob. 
4-59) for transmitting a single message. If Wj,/W/ = 27 for the system, what 
must Wy/W be for the message in order to achieve a 30-dB output signal- 
to-quantization noise power ratio? Assume the minimum channel bandwidth. 
For the system of Prob. 4-63, at what input peak signal-to-noise power ratio 
does threshold occur if W L /W f = 0.05? 

For a sinusoidal message of frequency ay show that, for equal sampling rates, 
a delta-sigma modulation system’s performance can be no larger than three 
times that of a delta modulation system, assuming both use a single integrator. 
That is, show that 

W«/d-sm W \N iJdm \Ng/m 

, Determine and plot (in decibels) how much poorer a practical D-SM system 
performs when it uses a simple lowpass filter of 3-dB bandwidth ay as 
compared to a system with an ideal integrator. Plot versus ay/W/ s 1 when 
the message is a sinusoid of frequency ay = W f . 

, Show that the threshold input peak signal-to-noise ratio (1-dB definition) in 
a delta-sigma modulation system is given by 


= 2( erfc 


mMnNrW} 


In a D-SM system a number of similar messages are time-multiplexed. 
Each message has a crest factor of 3, spectral extent of 3 kHz, and power 
level equal to its reference level. The system sample rate per message is 
64 kHz and channel pulse duration is J of a time slot duration, (a) Find 
(SJN,)osu. (b) Calculate and determine if this system is operating 

above or below threshold when (5 ,/N,) d . S m = 8 at the receiver. Assume 
negligible overload noise. 

What maximum dynamic range improvement can be achieved in an adaptive 
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delta modulation system if the step size can be reduced from its maximum 
by a factor of 20? What is the improvement for factors of 40, 80, and 160? 
4-70. Repeat the computations of the text that lead to Fig. 4.13-2 except assume 
step size is given by 


So 

So, 


Soo + L _ Soo\ f\t) 
So, \, So,/ f)(t) 

1, 


f\t) af r (t) 

J J U)>fW). 


Plot curves only for/ 2 (/) =s f\i) so that the slope overload term of (4.13-9) 
can be ignored. 

4-71. Use (4.14-9) and develop a similar expression for a DM system to show that 
(4.14-10) is true if the two systems have equal channel bandwidths, values 
of K, and use the same message. Assume both systems use the largest 
allowable pulse durations. 

4-72. Performance of a DM system when a particular message is used is (SJN g ) DM 
= 6310.0 (or 38.0 dB). When the same message is sent over a DPCM system 
with the same channel bandwidth and value of K as the DM system, its 
performance is (S 0 /N g )nrcM = 400,691.2. How many bits does the DPCM 
system use per sample? 

4-73. By using (4.14-11) show that the value of x which optimizes (SJN^^ for 
a given ratio a i s /W f is given by 



Chapter 5 


Bandpass Binary 
Digital Systems 

5.0 INTRODUCTION 

Communication systems often involve modulation of a earner, which results, 
of course, in a bandpass waveform. Radio and television signals are good 
examples involving analog messages. A good example where the message 
is digital is the modem, a device used to connect a remote computer terminal 
to the main computer. This modulation-demodulation apparatus modulates 
a carrier with the terminal’s data stream for transmission to the computer 
(often over telephone lines) and recovers the data stream sent by the computer 
via a similar modulation. 

In this chapter we shall examine several methods by which carriers 
are modulated by digital data. Attention is restricted only to binary systems. 
Various M-ary bandpass digital systems are discussed in Chap. 6. Initially 
we consider coherent systems. Some reflection on the reader’s part will 
show that the theoretical developments of Secs. 4. 1-4.3 apply also to coherent 
bandpass systems, and we shall make use of these facts through reference 
to the earlier work. Those readers who may have omitted the earlier theory 
(as indicated in Sec. 4.0) can still profit from the results to be given but 
may wish to review the theory if a more detailed understanding is needed 
of how the results are achieved. 


5.1 OPTIMUM COHERENT BANDPASS SYSTEMS 

As noted above, the developments of Secs. 4. 1-4.3 for optimum binary 
systems also apply to bandpass systems, although they were applied only 
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to baseband systems in Chap. 4. In this section we summarize the earlier 
work for convenient reference. 

For two possible messages, labeled m t and m 2 that have probabilities 
P i = F(wt]) and P 2 = P(m 2 ), respectively, one of two possible waveforms, 
s,(t) or s 2 (t), is transmitted in a given symbol (bit) interval of duration T b . 
Which is transmitted depends, respectively, on which message, m, or m 2 , 
occurs in the symbol (bit) interval. The waveforms are arbitrary except 
they are assumed real and nonzero only in their symbol interval. They are 
transmitted over an infinite-bandwidth channel having white noise, njt), 
with two-sided power density NJ2 at the receiver. The composite waveform 
at the receiver becomes 


= HiW + ««•('), m { sent .. , . 

1 \s 2 (r) + njt), m 2 sent. 

The optimum receiver is that which decides m 2 was transmitted if its 
probability of being sent [given the observed receiver input waveform r(t) 
occurred] is larger than the probability that m , was sent (given the same 
observed waveform). The optimum receiver makes this decision during 
each and every symbol interval by performing the following test: 


(5.1-1) 


If f Kt)[s 2 (t) - s,(t)] dt > V T , choose m 2 , 

* / ° (5.1-2) 

otherwise choose m , . 

Since the exact forms of j,(r) and s 2 (t) are known to the receiver, it is called 
coherent .t The receiver does not know which waveform is being received 
in any given symbol interval, however. 

In (5.1-2) V T is the optimum threshold that separates the regions of 
decision; it is given by 

^2 ~ El Jf 0 , ( Pl\ 

v, “ 2 + T ”\pj' (5 -'' 3) 

where the energies in the received waveforms are 


choose m 2 , 


*-r 

- / o 


Sf(t) dt. 


i = 1 and 2. 


(5.1-4) 


The structure of the optimum coherent receiver follows the imple- 
mentation of the left side of (5.1-2) as illustrated in Fig. 4.2-1 of Chap. 4. 
For convenience of reference the figure is reproduced here as Fig. 5. l-l(a). 
An equivalent matched filter form is shown in (b) as taken from Fig. 
4.2-2. 

t For simplicity of discussion we assume an idealized channel that does not act to 
cause the received waveforms to differ from those transmitted. Practical channels alter the 
transmitted signal’s phase and time of arrival. A coherent system must also compensate for 
these effects through carrier and bit-timing synchronization. In this book we assume, for the 
most part, that synchronization has been accomplished, so our signal model is valid. 
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Mr) 


(a) 


D >0, 
D = 0, 
D <0, 


Choose m 2 
Random Choice 
Choose m l 



Choose m 2 
Random Choice 
Choose m 1 


(b) 

Figure 5.1-1. Optimum coherent binary receivers, (a) Correlator form and (b) 
matched filter form. 


Output Signal Levels and Noise Power 


Let r„{T b ) denote the receiver’s computation of the left side of (5.1-2) 
and let sJT b ) and n a {T h ) represent, respectively, the output signal and noise 
at the symbol decision time (end of interval); then 

r 0 (T b ) = s 0 (T b ) + n 0 (T b ). (5.1-5) 

In Chap. 4 we found that 

f V, = y\fE x E 2 - £,, m, sent 

(5.1-6) 

[v 2 = E 2 - y\jE t E 2 , m 2 sent 


where 


y = 


r n 

I sft)s 2 (t) dt 


1 


(5.1-7) 
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represents the (normalized) correlation between the two waveforms j^/) 
and s 2 (t). Output average noise power, denoted by <j 2 „, is 


(5.1-8) 


°-o = n„\T b ) = (yJ[£j + Ei - 2y\/E t E 2 ]. 


Probability of Error 

In general, (4.3-5) gives the average probability of bit errors, denoted 
by P e . However, in many practical situations the simplifying assumption 
P\ - P 2 - i can be made, so that 


P e = - erfcf / E 2 + E t - 2y^E t E 2 ~ 

2 LV 4^0 

If, in addition, energies are equal so E = E, = E 2 , then 

= \ erfcf /E I 

2 [V 2 Jf 0 J 


(5.1-9) 


(5.1-10) 


Finally, for antipodal signals where j,(/) = -s 2 (t), we have y = - 1 and 
(5.1-10) reduces to 


(5.1-11) 


p e = r erfc I— 


In the next few sections we consider several specific types of coherent 
bandpass systems for which the above optimum results apply. 


5.2 COHERENT AMPLITUDE SHIFT KEYING 

In binary amplitude shift keying (ASK), the amplitude of a carrier is switched 
between two levels, usually the extremes of full on and full off. As a result, 
ASK is sometimes referred to as on-off keying (OOK). The on condition 
might typically correspond to a code 1, whereas off corresponds to a code 
0. In this section we examine the OOK version of ASK assuming carrier 
pulses of rectangular envelope with no spacing between pulses and equal- 
probability messages (/’, — P 2 - J). 


System Implementations 


Figure 5.2-l(a) depicts an optimum binary ASK system (OOK variety) 
as derived from Fig. 5.1-l(a). The transmitter consists of a stream of data 
digits that modulates a carrier to generate the signals 


•^askM 


S|M = 0, 

s 2 (t) = A cos(a > 0 t + d 0 ), 


0 « t =s T b , binary 0 ,, , 

0 =£ t s= T b , binary 1. ( l> 
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Figure 5.2>1. Block diagrams of (a) an optimum coherent ASK system, (b) an equivalent 
receiver, and (c) a matched filter receiver equivalent to that in (b). 


Coherent Amplitude Shift Keying 


The energy in s 2 (t) over one interval is 


(5.2-2) 


whereas the average energy per bit divided by twice the channel noise 
density is 


A 2 T b 1 1\ 1 A% 


2 ' 2 + ° ' 2/Ji 0 4^o ' 


(5.2-3) 


The receiver integrates the product iit)s 2 (t) over each symbol interval. At 
the end of each interyal, a sample is taken of the difference between the 
integrator’s output and the threshold level V T . If the difference D is positive, 
the decision is that m 2 was transmitted in the interval; if £> < 0 the decision 
is in favor of m u but D = 0 leads to an arbitrary (random) choice of m, 
or m 2 (ideally, this event has zero probability of happening). 

The receiver of Fig. 5.2-l(b) is clearly equivalent to that of (a). From 
earlier work, the product involving A followed by the integrator can be 
replaced by a filter matched to the envelope of s 2 (t), as shown in (c); this 
form of receiver can be viewed as a matched form of coherent detector. 
The required filter transfer function from (4.2-17) is (also see Prob. 5-7). 

H r (o>) = AT b e ->V 2 Sate) . (5.2. 4) 


In the general theory leading to the system of Fig. 5.2-l(a), it was 
tacitly assumed that s 2 (t) was the same function regardless of what interval 
m 2 occurred. This condition is guaranteed if the coherent carrier’s frequency 
cuo/ 277 is a multiple of the bit rate \/T h , regardless of the phase angle 8 0 . 
However, the theory really requires only that the receiver know s 2 (t) in 
the interval it is used [s 2 (0 can vary from interval to interval as long as 
the receiver variations follow]. These facts mean that, if the local carrier 
is coherent, we only require wJ2n to be a multiple of half the bit rate. 
Even this condition can be ignored if (o 0 /2tt » l/T b and the local carrier 
is coherent. In the last case the receiver must still generate a locally 
coherent carrier, but the transmitter frequency can be arbitrary so long as 
it is much larger than the bit rate. 

ASK Noise Performance 

Since j,(/) = 0, E, = 0 and y = 0. From (5.1-9) error probability is 

r - - GHVjI] ■ (DHViD- (5 - 2 - s 

In terms of e, an alternative form is 


(5.2-6) 
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This function is plotted in Fig. 5.12-1 for comparison with other systems. 
Another useful form for P e results from defining peak signal power and 
average noise power at the sampler’s input at the sample time by 


5,- = (V 2 - V T ) 2 = 


(5.2-7) 


N, = ^ = 


respectively. We have 


where 


o 1 USA 

P* = ,erfc /- — 

2 y 2 \N ij ASK 


- 

ASK 


(V 2 - V T f A% 


(5.2-8) 


(5.2-9) 


(5.2-10) 


Example 5.2-1 

A binary ASK system for equally probable messages uses 100-p.s bits and a channel 
for which = 1.338(10' 5 ) W/Hz. We find the peak transmitter pulse amplitude 
that will produce a bit error probability P, = 2.055(1(T S ). From (5.2-6) erfc 
[Ve/2] = 4.11(10 5 ), which occurs with \Je/ 2 = 2.9 (from Appendix F). Thus 
e = 2(2.9) 2 = 16.82. From (5.2-3) 

A = 2 Vif = 2 V 1 - 338(10 ' 5) if^ = 3 - 0V - 

Note that this system has an input peak signal-to-noise ratio of 


= e = 16.82 (or 12.26 dB). 


Signal Power Spectrum and Bandwidth 

The transmitted ASK signal is equivalent to a carrier cos(co 0 f + 0 O ) 
multiplied by a unipolar pulse format. Since the power spectrum of the 
unipolar pulse stream was found in (3.7-10), we easily use the result of 
Prob. B-54 to obtain the power spectrum of the product: 


5^ask(<v) — (— j [S(&> — o» 0 ) + 8(to + to 0 )] 


^){sa 2 ^ ^ )Tb + Sa 2 


(5.2-11) 


(to + top )T b 
2 


The bandwidth of this spectrum between first nulls on each side of the 
carrier is 4i r/T b = 2w b , twice the bit rate. 
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Local Carrier Generation for Coherent ASK 

The coherent system requires a local carrier having the same phase 
as the incoming carrier wave. The need for generating this local signal is 
a mam disadvantage of the coherent system. Conceptually, all we require 
to generate the local signal is a very narrowband bandpass filter centered 
at frequency a> 0 followed by an amplifier-limiter, as shown in Fig. 5.2-2(a). 
Because the ASK waveform will contain a large amount of carrier, a filter 
can se ect out the carrier and reject information sidebands if its bandwidth 
is small enough. An amplifier-limiter is then used only to provide a fixed- 
level output. Implementation of the method may be difficult in practice 
owing to the need for the very narrow bandwidth filter at large center 
frequencies. A more practical method would be to first mix down to a 
lower frequency, use a filter-amplifier-limiter, and then mix back up to the 
carrier frequency, as illustrated in (b). Still another scheme uses the phase- 
locked loop of (c); it behaves as a narrow filter at center frequency to 0 . 



(b) 



(c) 

Figiire 5.2-2. Local carrier generation methods for ASK. (a) Using narrowband 
niter, (b) narrowband filter wtth mixers for center frequency shifting, and (c) the 
phase-locked loop [1], 


5.3 PHASE SHIFT KEYING 

In binary phase shift keying (PSK) the phase of a carrier is switched 
between two values according to the two possible messages m, and m 2 . 
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The two phases are usually separated by tt radians and this is the only 
case we consider; it is sometimes called phase reversal keying (PRK). 
Phase reversal keying corresponds to the two possible transmitter waveforms 

_ fji(f) = — A cos(o>of + 0o)> 0« t^T b , m, sent, 

■SpskW = +A cos(ct) 0 f + e 0 ), 0 ^t^T b , m 2 sent. 

The transmitted signal is equivalent to a double sideband suppressed-carrier 
amplitude-modulated waveform where the information signal is a digital 
waveform with polar format. 

Optimum System 


Figure 5.3-1 illustrates the block diagram of the optimum PSK system 
for P, = P 2 = i, as derived from Fig. 5. l-l(a). Because of the equal- 
energy condition (£, = E 2 ) and the equal-message-probability assumption, 
threshold is zero from (5.1-3). From (5.3-1) we see that s,(t) and s 2 (t) are 
antipodal, so that y = -1. The output signal levels at the sample time 
derive from (5.1-6): 


s 0 (T b ) 


V, = -2E, = -A 2 T b , 
V 2 = 2 E 2 = A 2 T b , 


m x sent 
m 2 sent. 


(5.3-2) 


From (5.1-8) the output noise power is 

o- 2 = M 0 A 2 T b . (5.3-3) 

It is often assumed in the optimum PSK system that the transmitted 
carrier frequency is a multiple of the bit rate; that is, (o 0 = mai b , where m = 
1, 2, .... This assumption, as in ASK, makes the waveforms Ji(/) and 
s 2 (t) identical in any intervals in which the respective messages m, and 
m 2 occur. However, it can be shown (Prob. 5-16) that the system of Fig. 5.3-1 
remains optimum if ai 0 is a multiple of half the bit rate (cu 0 = mw b /2 for 
m = 1,2,...) provided the receiver’s local carrier remains phase coherent 
with the transmitted carrier. 


Spectral Properties of PSK 

The binary PSK waveform of (5.3-1) can be considered to be a double 
sideband suppressed-carrier amplitude modulated signal (DSB) where the 
message is digital. The power spectrum of such a signal is comprised of 
scaled translations of the message’s power spectrum, given by (3.7-11), to 
frequencies ±co 0 according to Prob. B-54; we have 

^psk(co) = jsa 2 (o)-co 0 )y + Sa 2 |\a; + <o 0 )y j. (5.3-4) 

Bandwidth between first nulls about the carrier is 2<o b . 

The spectrum of (5.3-4) contains no discrete components (impulses) 
at ±cu 0 , which makes it difficult for the receiver to generate a coherent 
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local carrier. This problem is identical to that encountered in DSB systems, 
where Costas loops and squaring circuits are used to generate the local 
carrier. These techniques produce phase ambiguities of it radians, however, 
which must be resolved in the PSK application. The resolution can be 
accomplished by periodically transmitting a known signal for sign syn- 
chronization. Another technique differentially encodes the binary data prior 
to transmission; on reception the demodulated bit stream is then decoded 
[decoder shown in Fig. 3.6-3(c)]. 

Still another approach to local carrier generation is to carrier-modulate 
with phases not quite 180° apart. The transmitted spectrum would then 
contain a small discrete component that could be recovered by the methods 
used in ASK (Fig. 5.2-2). The disadvantage of this technique is an attendant 
increase in bit error probability. 

Error Probability of PSK 



Example 5.3-1 

Suppose we examine a PSK system that uses the same bit duration (100 ps), the 
same channel noise level [-V 0 = 1.338O0' 5 ) W/Hz], and the same peak amplitude 
of transmitter pulses ( A = 3.0 V) as the ASK system of Example 5.2-1. Here 


(3) 2 10~ 4 
2(1. 338) 10' 5 


33.6323. 


Since e is large, we may use the approximation (F-5) for the complementary error 
function to get 


P e = ——= = 1.204(10“ 16 ). 
2\ns 
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When this value of P t is compared to P t = 2.055(1(T 5 ) for the ASK system, we 
find a vast improvement. However, if the PSK system had used the same peak- 
to-peak separation in the transmitted signal as the ASK system, we would have 
obtained equal values of P t . The reader is urged to prove this fact as an exercise. 


5.4 QUADRATURE PSK 

System for Two Message Sources 

Consider two PSK systems, one exactly as shown in Fig. 5.3-1 and a 
second that is identical to the first except that it uses a carrier sin(cu 0 f + 0 O ) 
instead of cos (co 0 t + 0 O ). It can be shown (Prob. 5-22) that the two systems 
operate independently as long as o> 0 is an integral multiple of half the bit 
rate (o> 0 = mco b /2, m = 1,2,...). By combining the two into a single 
equivalent system, we form a means of doubling the rate at which bits are 
transmitted over the channel (on the same carrier). The resulting system 
is called quadrature PSK (QPSK). 

The equivalent transmitted signal is the sum of the two separate 
waveforms t 

■SqpskM = +A cos(cu 0 r + 8 0 ) — (+A)sin(o> 0 / + 0 O ) (5.4-1) 

for 0 ^ t T b . The factor +A of the cos(o> 0 r + 0 O ) term represents the 
data from one source, whereas the + A on sin(co 0 f + 0 O ) represents data 
from the second source. In each case the upper sign goes with a binary 
0, and the lower sign corresponds to a I. It is readily shown that (5.4-1) 
can be written as 

■SqpskW = y/2A cos(oi 0 / + 0 O + 0*), (5.4-2) 

where B k can have values ±7r/4, and ±37 r/4 according to the messages. 
The possible amplitudes and phases (8 k ) of j QPS k(0 are shown as points in 
Fig. 5.4-1 . For example, if the message source modulating the cosine carrier 
is a binary 0 and that of the second source modulating the sine carrier is 
a 1, then the corresponding polar waveform levels are -A and A, respectively; 
the transmitted composite waveform peak amplitude is \J2 A, and the value 
of 0 k is 37 t/ 4. Diagrams like Fig. 5.4-1 are called signal constellations; 
more will be said about these in Chap. 6. 

Figure 5.4-2 illustrates the QPSK system just described. Because the 
channel is actually supporting two PSK signals at the same carrier frequency 
and bandwidth, the spectral bandwidth of QPSK is the same as a single 
PSK system. Average probability of a bit’s being in error in either of the 
two receiver channels is also the same as in PSK. From (5.3-5) we have 


t For convenience we add the negative of the system using the sine carrier. The sign 
can be accounted for in the receiver. 
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Figure 5.4-1. Possible amplitudes and phases of a QPSK waveform. 



Sample 


Figure 5.4-2. A QPSK system having two polar formatted message sources 
sharing the same channel and carrier frequency. 



either message channel, 


(5.4-3) 


where T b is, of course, the bit duration of the message polar format. 


System for Single Message Source 

Although the QPSK system just described assumed separate message 
sources at the same symbol (bit) rate, it is possible to have the system 
work with a single source at twice the bit ratet by using a serial-to-parallel 


t T b now is half T b used before for each of the two sources. 
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converter. The functions performed by the transmitter modulator are shown 
in Fig. 5. 4-3 (a). The converter accepts the binary polar-NRZ formatted 
waveform with bit rate f b = \/T b and converts two bits at a timet into two 
new polar waveforms with “symbol” duration T, where 

T=2T b . (5.4.4) 

Example waveforms are shown in Fig. 5.4-3(b). It is helpful to associate 
polar waveform 1 with the first bit of the 2-bit “words” and polar waveform 
2 with the second bit; this convention was used in the example waveforms. 


Polar Waveform 1 
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The receiver is again two PSK channels in parallel that respond to 
their individual carrier component. Figure 5.4-4 illustrates the receiver. 
Processing channel 1 makes a decision (based on Di) each symbol interval 
of duration T; these decisions correspond to the first bit of the 2-bit original 
message sequence during time 7. Similarly, processing channel 2 is associated 
with the second bit. Since the two channels behave as independent PSK 
channels, their error probabilities are equal and give the bit error probability 
for the overall message data stream (at rate \/T b = 2/T). From (5.3-5) 



Finally, the output data stream (bit rate of \/T b - f b ) is generated in a 
parallel-to-serial converter using decisions made in the processor channels. 


Processor Channel 1 



Processor Channel 2 


Figure 5.4-4. QPSK receiver functions for a single message source. 

On casually comparing (5.4-5) with (5.3-5) it might appear that QPSK 
has lower P e than PSK when both transmit the same message. An example 
will demonstrate that performances are actually the s ame. 

Example 5.4-1 

Let the message bit rate (1/7),) and channel noise levels (MJ2) be the same when 
a fair comparison of bit error probability is to be made between PSK and QPSK 
systems. From (5.3-5) and (5.4-5) the two values of P t will be the same if A /2 in 
the PSK system equals A 2 in the QPSK system. That this is indeed true can be 
seen from (5.3-1) and (5.4-2). The peak amplitude of the PSK channel pulse is A, 
so peak pulse power is A 2 / 2. The channel pulse in QPSK is \/2 A, so peak power 
is A 2 . Clearly, a fair comparison of systems gives no power advantage to either. 
The equal-power condition, therefore, corresponds to equal values of P,. 

From the preceding example it is clear that the transmitted average 
energy per bit is (\/2 A) 2 T b /2 = A 2 7 b , so (5.4-5) can be written as 

P e = - erfc (V e) 


(5.4-6) 
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where e, as usual, is the ratio of average transmitted energy per bit to jV 0 : 



The behavior of (5.4-6) is shown in Fig. 5.12-1. 

Other Quadrature Modulations: QAM and OQPSK 

In the QPSK systems of Figures 5.4-2 and 5.4-3(a), modulated carriers 
in phase quadrature were combined to form the output waveform. This 
process is called quadrature modulation (sometimes also called quadrature 
multiplexing). In QPSK the amplitudes of the modulating polar waveforms 
and modulator gains are made as nearly equal as possible. The resulting sig- 
nal constellation has points located at the corners of a square (Fig. 5.4-1). 

A generalization of quadrature modulation allows message polar wave- 
forms to have unequal amplitudes. The signal constellation points now fall 
on the comers of a rectangle. The corresponding waveform is called quad- 
rature amplitude modulation (QAM). Sometimes it is also known as un- 
balanced QPSK. Although there are some differences in where filters are 
placed in a practical system to establish spectral shaping [3], QAM and 
QPSK are quite similar. Indeed, the names are often used interchangeably 
in practice (equal-amplitude case). 

The amplitude of a QPSK signal is ideally constant, a desirable property 
in one of its most important applications, a satellite link. Here the signal 
is bandlimited by a bandpass filter to conform to out-of-band spectra] emission 
standards. The filtering degrades the constant-amplitude property of the 
QPSK signal. At the satellite repeater amplitude limiting restores this property 
but increases the out-of-band spectral levels again [4]. A modified form of 
QPSK, called offset QPSK (OQPSK), has been devised that is less susceptible 
to these effects. 

The system of Fig. 5.4-3(a) becomes an OQPSK modulator if a delay 
of 1 bit ( T b ) is added prior to the modulator in the line carrying polar 
waveform 2. The effect is to move the polar waveform 2 of (b) right by 
1-bit duration T b . The reader can no doubt recognize that the largest phase 
change is ± tt/2 per bit interval T b , whereas the largest was ± v per symbol 
interval (2 bits) in QPSK. Thus phase changes are smaller and more frequent 
in OQPSK, which leads to smaller amplitude fluctuations following band- 
limiting. Even after amplitude limiting to remove fluctuations, tests [4, 5] 
indicate that the OQPSK spectrum remains nearly unchanged so that out- 
of-band spectral levels remain near their desired levels. 

The receiver for OQPSK is a modified version of the QPSK receiver 
of Fig. 5.4-4. The lower integrator still integrates over each symbol interval 
T, but its intervals are delayed by T h relative to the upper processor channel. 
Decision values D, then occur at times T b + T, T b + 2 7, T b + 37, and 
so on, whereas D 2 occurs at times IT, 37, 47, and so on. Addition of a 
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delay T b will synchronize values of D ] with values of D 2 for use in the 
converter. Since staggering bit streams does not change the properties of 
the quadrature modulation process, OQPSK has the same average probability 
of bit error as QPSK. Even in the presence of reference carriers with phase 
jitter, OQPSK seems to perform better than QPSK [4, 6]. Furthermore, 
the offset of T b in the two bit streams in OQPSK has been shown to be 
optimum in terms of phase jitter immunity in the presence of additive 
Gaussian noise [4, 7], 

Carrier Recovery in QAM and QPSK 

Since QAM and QPSK are coherent systems, their receivers require 
locally generated coherent carriers. One method of generating these carriers 
uses the squaring loop shown in Fig. 5.4-5(a). It applies mainly to QAM, 
where it is used when more than 73% of transmitted power is due to one 




Figure 5.4-5. Methods of generating local carriers in QAM or QPSK. (a) Squaring 
loop and (b) an equivalent network not requiring a phase-locked loop. 
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of the quadrature components [8], The phase-locked loop acts as a combined 
frequency divider and narrowband filter at frequency 2w 0 . In some applications 
the acquisition (lock-up) time of the loop is too long and the equivalent 
circuit of (b) is useful. It uses a passive narrowband filter followed by a 
divider. 

When less than 73% of the total power is in the largest-power quadrature 
component in QAM, or when carrier recovery in QPSK is required modified 
forms of the networks of Fig. 5.4-5 should be used [8], Modifications 
involve replacing squares by fourth-power devices, + 2 and x2 networks 
by -i-4 and x4 devices, and centering filters at 4cu 0 . (See Prob. 5-26.) For 
constant-envelope infinite bandwidth input, a pure reference carrier can be 
recovered. However, in practice some bandlimiting occurs and the carrier 
contains some modulation-dependent interference called pattern noise [9], 
These fourth-power nonlinear networks contain phase ambiguities at multiples 
of 7 t/ 2 and some provision must be made to correct for their effects. 

Another method of carrier recovery in QAM and QPSK uses a modified 
Costas loop, as illustrated in Fig. 5.4-6 [3, 10-12], The network has phase 
ambiguities at multiples of n/2. (See Prob. 5-27.) Performance of Costas 
loops in noise can be optimized [13-14] by special selection of the filters 
at the carrier product device outputs or by using integrate-and-dump filters. 
The disadvantage of the latter approach is the need for symbol timing of 
the integrators. 

Other methods of carrier recovery exist. We mention only one, called 
a remodulator [9, 15]. The received signal is first demodulated using quad- 
rature versions of the local carrier, much as is done in the first stage of 
the receiver. The two message outputs are each lowpass filtered, sym- 
metrically limited, and used to multiply (remodulate) quadrature versions 
of the input signal (which is delayed slightly at these multipliers to compensate 
for network delays). A summation of the product outputs (with appropriate 
sign) is bandpass filtered and limited to remove amplitude fluctuations. The 
final signal is the local carrier that was used in the start of the description. 



Figure 5.4-6. Costas loop for local carrier recovery in QAM or QPSK systems. 
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When used with an interval of unmodulated carrier (called a preamble) 
for initial alignment, the remodulator does not contain a phase ambiguity 
[9, p. 140]. 

The reader is also referred to a special IEEE publication on syn- 
chronization topics [16]. 


5.5 COHERENT FREQUENCY SHIFT KEYING 

In binary frequency shift keying (FSK), the frequency of the transmitted 
signal is switched between two values 

oj[ = a> 0 — Aw (5.5-1) 

w 2 = coq -f- A w (5.5-2) 

according to messages m, and tn 2 (for binary 0 and 1), respectively. Here 
w 0 represents a nominal carrier frequency, and Aw represents a frequency 
deviation due to message modulation. Two principal versions of FSK can 
be defined based on how the frequency variations are imparted into the 
transmitted waveform. We consider both versions. 


FSK Using Independent Oscillators 


In one simple system FSK is generated by switching between two 
independent oscillators according to which message occurs, as shown in 
Fig. 5.5-1. In general, this form of FSK involves phase discontinuities at 
switching times and is not often implemented in practice. It is of historical 
interest, however, and does form a basis of comparison for other FSK 
systems. Thus although we consider this form of FSK, often called dis- 
continuous FSK, our treatment is brief. 

Transmitted waveforms involved are 

t \ _ = -A cos(a),t + 0i), 0 « t =£ T b , m x sent , ,, 

•W') - = A cos (w 2 f + 0 2 ), 0 € t « T„, m 2 sent. 

Here 0, and 0 2 are arbitrary phase angles and the transmitted energy per 
bit is 



(5.5-4) 


so 


E, _ E 2 _ A 2 T„ 
Jf 0 2Jf 0 ' 


(5.5-5) 


Receiver structure, received signal and noise levels, error probability, 
and threshold all derive from the work in Sec. 5.1. For simplicity, we 
assume equally probable messages [Elm,) = i and P(m 2 ) = 4], even though 




i 

i 

4 



o o' o' 
A ii V 
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the more general case could be stated. For convenience, we also assume 
«>, and cu 2 are positive integer multiples of the bit rate u> b = 2ir/T b . This 
condition guarantees that the forms of s,(t) and s 2 (t) will not change as a 
function of the bit interval in which they are to be transmitted. With these 
assumptions, y is readily computed from (5.1-7) as follows: 

1 r Tt 

7 = - | S : (t)s 2 (t) dt 


-vmJ. 

2 r 

= — I cos(o)it + d l )cos(oj 2 t + d 2 ) dt 

Tf, -Jo 


(5.5-6) 


= Sa(Ao)r i ,)cos(Acpr i , + 6 2 - d\) 

+ Sa(a) 0 T b )cos((ti 0 T b + d 2 + e t ). 

Since w 2 = M 2 <o b and w, = M x oi h , with M, and M 2 > M i both integers, 
we find from (5.5-1) and (5.5-2) that both A coT b and w 0 T b are integral multiples 
of 77. Hence 

y = 0. (5.5-7) 

This FSK system is called orthogonal because y = 0. 

After having established y, other quantities readily follow. Receiver 
output signal levels and noise power, from (5.1-6) and (5.1-8), are 

° {Th) [V 2 =A 2 T b /2, m 2 sent (5 ' 5 


(5.5-8) 


oi = A^. 


(5.5-9) 


Equation (5.1-3) gives the optimum receiver threshold 

V T = 0. (5.5-10) 

The resulting receiver is shown in its correlation form in Fig. 5.5-1. An 
equivalent form using passive filters is also possible. 

Finally, average bit error probability is 


a e erfcl" /^1-Ierfcf S 
2 [_ V 4 jV 0 J 2 LV2j 


from (5.1-10). This function is plotted in Fig. 5.12-1. In terms of peak 
signal power to average noise power (at the sampler) at the sample time, 
we have 


Si = V] = V\ = 


A 2 T b V 


Ni = <rl = A 2 T b -2 


(5.5-13) 
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Example 5.5-1 

For comparison to the ASK system of Example 5.2-1, let an FSK system use the 
same values of A, T b , and «V 0 . Thus from (5.2-10), the previous example, and 
(5.5-14), 



From (5.5-15), P e - 0.5 erfc[\/l6.82] — 3.41(10 -9 ), which is smaller than P. in the 
ASK system by a factor of 6026.4. 


Example 5.5-1 demonstrates the superiority of FSK over ASK with 
equal peak signal amplitudes. If equal average energy becomes the basis 
of comparison, the two systems perform equally with noise. 

The power density spectrum of the discontinuous FSK waveform is 
known [17, eq. (18)] but has a rather complicated form that depends intimately 
on phases 0, and d 2 . Generally, the power spectrum falls off as the reciprocal 
of the square of frequency for frequencies far from the carrier. It has two 
components, a continuous part arising from switching and a discrete com- 
ponent (with impulses) at the frequencies o> 0 ±Aa> and -o> 0 ±Aco. Half 
the total power is in the continuous part and half is in the discrete components. 
For the special case where 0 t and d 2 are assumed to be zero-mean independent 
random variables, uniform on (0, 2ir), and both o>, and a> 2 are integral 
multiples of the bit rate a> b , the spectrum reduces to 

^fsk(vj) = y(a>) + SP(- o>). (5.5-16) 

where we define 

A 2 77 

^ Q> ) = g — cu 0 — Aw) + S(oj — Wo + Aw)] 

(5.5-17) 

+ Sa 2 — — 6,0 + ] 

L 2 Jj- 

Figure 5.5-2 illustrates the behavior of ^ FSK (w) for frequencies near 
w 0 when (5.5-17) applies and Aw = w*/2. Bandwidth between first nulls 
on either side of the carrier is 3 ai b . 


■ | Sa 2 


(w - w 0 - Aw) T b 
2 
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fsjc( u ) 



u - CJp 


Figure 5.5-2. Power spectrum for discontinuous phase FSK, where 0, and S 2 
are independent uniformly distributed random variables on (0, 2ir). 


Continuous Phase FSK 


Another method of generating FSK is to frequency modulate a single 
oscillator by the message waveform. The resulting waveform is called 
continuous phase FSK (CPFSK). The applicable waveform is now 

_ fjiW = Acos(w 0 r - Amt + 0 O ), 0 =sr=s7;, m, sent 

SfskW \s 2 (t) = A cos(w 0 t + A cot + 0 O ), 0 =£ t^T b , m 2 sent. 

(5.5-18) 

Again, if equal message probabilities are assumed, threshold, integrator 
output signal levels, output noise power, and bit error probabilities result 
from developments of Sec. 5.1: 

F r = 0 (5.5-19) 


sJT b ) = 


(1 - y)A% 


(1 - y)A% 


m i sent 


m 2 sent 
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2 


(5.5-21) 


P e = \ erfc r- . (5.5-22) 

2 1_V 4vV 0 J 

The applicable optimum receiver is identical to that shown in Fig. 5.5-1. 
Of' course, the transmitter is now just an FM oscillator having a polar 
modulating waveform. 

On use of the definition of y, (5.1-7), we find that 

2 r n 

y = — I cos(w 0 t - Aw/ + 0 o )cos(w o / + Ac «/ + 0 O ) dt 

Jb (5.5-23) 

= Sa(2Ati>7' i) ) + Sa(&> 0 r i) )cos(cu 0 7’ i , + 20 o ). 

The second term in (5.5-23) is zero if aj 0 T h is an integral multiple of ir\ 
alternatively, if a) 0 T b » 1 the term is negligible, a condition often true in 
practice. We assume one of these two conditions is true, so that 

y = Sa(2Acu7' i) ) (5.5-24) 

in the remainder of this section. We shall consider two choices of y. 

First, suppose we make the FSK system orthogonal, as resulted with 
independent oscillators. Orthogonality requires y = 0 which, in turn, requires 
2 Ac oT h to be an integral multiple of it in (5.5-24): 


b 1 T 

— , n — 1,2,.... 


(5.5-25) 


Since we are dealing with frequency modulation of peak frequency deviation 
Aw, we expect the bandwidth of the FSK waveform to be smallest when 
Aw is smallest. In other words, a spectrally efficient waveform follows the 
choice n = 1 in (5.5-25). The FSK system with n = 1, where Aw = w t /4, 
has minimum separation between frequencies for orthogonality and is called 
minimum shift keying (MSK). We examine MSK in greater detail in the 
next section. 

As a second choice, we find the value of y that is most negative 
because it leads to the smallest value of P e from (5.5-22). The largest 
negative value of y occurs where 2Aw T b ~ 3w/2 in (5.5-24). It is y = 
0.21, which gives 


= - erfc 
2 


= - erfc 
2 


/ l.2L4 : 7) 

4^ 0 


(5.5-26) 


e 


(5.5-27) 
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The factor 1.21 represents a performance improvement over orthogonal 
FSK of 0.83 dB. 

A popular application of coherent FSK is in keyboard-type computer 
terminal modems that connect the terminal to a main computer, often by 
dial-up telephone lines. After the telephone connection is made by dialing 
the main computer, some modems will allow simultaneous two-way data 
communication, called full-duplex operation. To accomplish full-duplex, 
separate transmit and receive carrier frequencies are used. A half-duplex 
terminal can either transmit (talk) or receive (listen) at any one time, but 
not both. Some modems operate in a simplex mode where they only talk 
or listen. An example of a typical modem will illustrate some of the ideas 
previously discussed. 


Example 5.5-2 

The AT&T model 103 FSK modem can operate full-duplex using transmit and 
receive carrier frequencies of 1170 Hz and 2125 Hz, respectively. t The bit rate 
is 300 bits/s and Aoi/27r = 100 Hz. For this modem <o 0 T b = 2-n-(1170)/300 = 
7.8 tt » 1, so (5.5-24) applies. We calculate 


y 


= Sa(2A<o7 n i ,) = Sa 


4tr(100) 1 _ /4w\ 

300 J S \3 j 


-0.207. 


This value of y is near the optimum value. We find P, from (5.5-22) assuming A - 
0.5 V and Jf 0 = 2.012(1(T 5 ) W/Hz: 

P i rfc r ,/ 

' 2 Lx 300(4)2.012(10 5 ) 

= X - erfch/12.498] - 2.98(10" 7 ). 


Note that this value of P, corresponds to a peak signal-to-noise power ratio at the 
sampler at the sample time of 


SA _Yl_Y\ = (1 - y)A% m 1.207(0.25) = 

vJ FSK ~ crl oj 2#o 300(2)2.012(10- 5 ) 


or 13.98 dB. 


Power Spectrum of CPFSK 

Anderson and Salz [18] and Bennett and Rice [17] have found the 
power density spectrum of the CPFSK waveform of (5.5-18). Because the 
analysis is quite involved, we only summarize the principal findings. 
The power spectrum .Sf CPFS K(w), valid for < cu < can be written 
[17, eq. (48)] as 

^cpfskM = SKa) + 9\-a) (5.5-28) 

t Frequencies apply to one of a pair of matched modems; the second modem’s frequencies 
are 2125 Hz for transmitting and 1170 Hz for receiving. 
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where 


= . (At oT b ) 2 Sa 2 [y ,(&>)] Sa 2 [y 2 (tu)] 

4 {1 + C\ - 2 C a cos[(tu - (UfPTf,]) 

T 

yi(co) = (tu — cu 0 + Aeo)y 
T 

^(tu) = (<t> — <u 0 — Ato)y 


(5.5-29) 

(5.5-30) 

(5.5-31) 


C a = cos(A a>T b ). (5.5-32) 

The spectrum of (5.5-28) applies rather generally. However, there are 
certain special cases that must be separately considered; these are sub- 
sequently discussed. Examples of power spectrum behavior are shown in 
Fig. 5.5-3. The curves were computed from (5.5-28), using (5. 5-29). t For 
A to/to;, not greater than about 0.3, the spectrum is smooth and single-peaked 
at the carrier. As Ato/to* increases in the range 0.3 < A to/ to* < 0.5, the 
curves develop two increasingly sharp and large peaks at frequencies of 
magnitudes |co — co 0 | < u> b /2. For Ato/ co b > 0.5 these peaks move outside 
frequencies o> 0 ± (to*/2) and decrease in sharpness and amplitude as A to/ to* 
increases. These spectrums are all continuous and have no discrete (impulse) 
components. 


Bennett and Rice [17] have shown that three special cases arise in 
determining CPFSK power spectrum. These are summarized in Table 5.5-1 . 
In cases I and II, the power spectrum is given by 

A 2 7T 

cpfsk(^) = — — [8(to - to 0 - Ato) + 8(to - to 0 + Ato) 

S (5.5-33) 


where w v (f) is determined by equations of [17] as referenced in the table. t 
In a similar manner, for case III we have 


•^cpfskIw) - - wf f ) (5.5-34) 

where w u (f) is given by equation (67) in [17], We note that cases I and 

II result in impulses in the power spectrum while the spectrum of cases 

III and IV are entirely continuous. In all cases the spectrum decreases as 
the inverse fourth power of frequency for frequencies remote from the 
carrier, this behavior is typical of coherent phase FSK waveforms using 
rectangular bit pulses. 


t The curves were generated from the term in (5.5-28) involving The term 

involving if(- to) is significant mainly at frequencies near -io 0 when oi 0 » oi t . 
t All our power spectrums are two-sided; those in [17] are one-sided. 
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TABLE 5.5-1. Special Cases in CPFSK Power Spectrums. 


Case 

Defining Conditions for Positive Integers r and / 

Applicable Equations 

I 

Aw/(w t /4) = even integer = 2 r and 
w 0 /(w t /4) ^ even integer = 21 

(53) or (54) in [17] 

II 

A<o/(o)(,/4) = even integer = 2 r and 
«o/(<u t /4) = even integer = 2/ 

(56) in [171 

III 

Aw/(coj,/4) = odd integer = (2r - 1) and 
wo/(g>i,/4) = odd integer = (21 - 1) 

(67) in [17] 

IV 

All other continuous phase cases 

(5.5-28), (5.5-29) 
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5.6 MINIMUM SHIFT KEYING 

As noted earlier, minimum shift keying (MSK) is the name given to CPFSK 
when keying is between two frequencies separated by half the data bit rate. 
These frequencies are co t = co 0 — Ac o and cu 2 = u 0 + Aa>, where ±A&> are 
the frequency deviations caused by the digital data waveform (a polar NRZ 
format with amplitude ± 1). Thus MSK is defined by 

(D b IT 

A(0 = J = 2T b ^ 

where T b is the duration of a typical data bit and co b = 2v/T h . 

One other characteristic of MSK serves to separate it from what we 
called CPFSK in the preceding section. The earlier continuous phase FSK 
system did not fully utilize all the phase information present in the received 
waveform, although it used enough to provide synchronization of the receiver 
to the transmitter through the local oscillator’s phase. By making better 
use of phase information the noise performance of MSK can be better than 
the earlier CPFSK system. 

MSK waveforms can be divided broadly into two categories of im- 
plementation, parallel and serial. Parallel MSK [4, 19-22] involves an 
implementation much like offset quadrature PSK (OQPSK). Serial MSK 
[23-26] has a somewhat simpler implementation than parallel and is sometimes 
preferred in high data rate systems. We subsequently discuss all these 
MSK implementations. However, it is first helpful to develop some properties 
of the general CPFSK waveform that help illustrate the ideas on which 
practical MSK systems are based. 

CPFSK Signal Decomposition 


Let d(t) represent a polar-NRZ waveform representing a binary data 
source. Bits are assumed to occur independently each T b seconds with 
equal probability. We let the sequence d k represent the amplitudes, assumed 
to be +1 for a data 1 and - 1 for a 0, in the various bit intervals indexed 
by k, k = ..., — 1, 0, 1, 2, .... We assume the time origin occurs at the 
start of the zeroth interval ( k = 0). These definitions are illustrated by 
Fig. 5.6-1 (a) . In interval k the frequency of the CPFSK waveform will bet 
w 0 + Aoj = o> 0 + (rr/2 T*) if d k = 1 and co 0 - (n/2T b ) if d k = -1. Thus 
we have 


5 cpfsk(0 — A cos[co 0 t + 0(f)] 


= A cos co 0 t + 6 k + 


kT„ / « (* + 1)7],. 


t We describe only the case where Aw = ir/2T b . Thus the CPFSK waveform described 
is the MSK waveform. 
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(b) 

Figure 5.6-1. (a) Data interval definitions in CPFSK. (b) Trellis and table of 
6 k - 0 O to describe phase history of a CPFSK waveform. 

If we think of 0(0 as the phase due to data modulation at the constant 
frequency d k ir/2T b , then 0* represents the phase at f = 0. Since d k can 
change from interval to interval, 0* can be different for each interval. 

The phase history of 0(0 is best illustrated by a trellis diagram, as 
shown in Fig. 5.6-l(b) for t s* 0. Plotted is 0(0 - 0 O so that the function 
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is zero at t = 0. In the zeroth interval, if d 0 = 1, frequency is larger than 
o>o by Ato = 2w/4T b , so phase increases linearly (dashed line) by an amount 
ko>T b = tt/2 to arrive at the point marked a. If d a = - 1 the solid line 
would lead to point b. From either of these points, the phase would again 
either increase or decrease linearly by another ±tt/2 according to d t being 
± 1 in the interval. From a, the paths lead to points marked by heavy dots 
as c or d. From b, we either arrive at point d (for d, = 1) or c (the open 
dot). However, since phase is a modulo-27r function we have shown only 
four main phase corners (w, tt/2, 0, -tt/2) by heavy dots and auxiliary 
open points b or c to handle the extremes of the 27t interval. By proceeding 
in the same manner in other intervals, the phase history of 0(f) - 0 O is 
seen to be a piecewise linear (continuous) function. The linear segment in 
interval k extrapolated back to t = 0 gives 0* - 0 O . A table of a few 
values of 6 k — 0 O is shown in the table of Fig. 5.6- 1 (b). We observe that 
0* — 0o can only have two values, 0 or rr (modulo 2ir), and its behavior 
is repetitive after four intervals. t 

This phase history is useful in decomposing (5.6-2) into a form useful 
for generation of MSK signals. From trigonometric identities and the facts 
that 0* — 0 O = 0 or ir and d k = ± 1 , we have 


^cpfskW — A cosl o 0 t + 0 O + 0* — 0 O + d k —— 
\ 27), 


= A cos yd k - 0 O + d k — lcos(o) 0 f + 0 O ) 


- A sin|^0* - 0 O + ^—jsinlcoof + 0 O ) 


= A cos (0* - 0 o )cos^— jcos(o) 0 f + 0 O ) 


Here we define 


- A d k cos (0* - 0 o )sin^— jsin(o> 0 f + 0 O ) 

= J|(f) A cos(o) 0 f + 0 O ) - s Q (t) A sin(«V + 0 O ). (5.6-3) 


sM = cos(0* - d 0 )cos[~ 


s g (t) = d k cos(0* - 0 o )sin|j^-j. 


(5.6-4) 


(5.6-5) 


Examination of the paths into and out of node c in the phase trellis of Fig. 
5.6-l(b) shows that 6 k - 0 O does not change at the node. A similar fact 

t Since 0 O is arbitrary, most authors simply assume 0 O = 0 so that the ordinate intercept 
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holds at node d. These facts mean that cos(0 k - 0 O ) does not change sign 
at nodes corresponding to kT b , k even. However, at nodes a and b (at 
kT b , k odd) changes in 6 k - 8 a can occur. All these points show that the 
sign of cos(0* - 0 O ) in (5.6-4) can change only at the zero crossings of 
cos(-irt/2T b ). 

Next, extend the examination to d k cos(6 k - 6 a ). Since d k can change 
at any interval we now find that the sign of d k cos(8 k - 6 0 ) can change at 
kT b , k even, which is now at the zero crossings of sin (irt/2T b ) in (5.6-5). 
Again turning to nodes a and b , we find that d k cos(O k - 0 O ) does not change 
sign (the reader may wish to work through the logic by comparing input 
and output paths). These points show that d k cos(8 k — 0 O ) in (5.6-5) changes 
sign only at the zero crossings of sm(wt/2T b ). 

Our logic has shown that the factors s,(t) and s Q (t) multiplying the 
quadrature carriers in (5.6-3) are constant in sign over successive intervals 
of duration 2 T b . The 2T b intervals of s Q (t) are delayed one bit duration T b 
relative to those of sM because of the relative timing of cos(nt/2T b ) and 
sin(irt/2T b ). Such timing is exactly the type already discussed in offset 
QPSK. Since the signs of s,(t) and s Q (t) are related to the original data 
sequence d k (as well as the alterating signs of the “carriers” at frequency 
Am = 2ir/4T b ), they can be taken as analogous to the modulating data 
waveforms in OQPSK. There is a difference, however, in that the symbol 
waveforms of duration 2 T b in OQPSK were rectangular, whereas in CPFSK 
the symbol waveforms are half-cycles of a sinusoid at frequency Aw/277 = 
1/47V 

In summary, our developments have shown that a CPFSK waveform 
can be decomposed into the sum of two modulated carriers in quadrature 
according to (5.6-3). The characteristics of the data-related modulating 
waveforms s^f) and s e (t) were seen to be closely analogous to OQPSK. 
The antilogy forms the basis on which most MSK systems are developed. 
We shall discuss several forms of MSK in following subsections, considering 
parallel implementations first because they most directly relate to the OQPSK 
analogy. 

Parallel MSK Systems — Type I 

A common MSK modulator that uses the analogy of (5.6-3) with 
OQPSK is shown in Fig. 5.6-2(a). The input data stream d(t) with bit rate 
\/T b is separated into two data streams, d,(t) and d Q (t), each with symbol 
rate 1/2 T b . The separation is exactly as in OQPSK where d,(t) is made 
up from odd-numbered bits in d(t) and d Q (t) is developed from even-numbered 
bits in d(t). Figure 5.6-3 illustrates the waveforms for a data stream having 
values {<4} = {..., -1, -1, 1, 1, -1, 1, 1, -1> 1. 1> — 1. •••} f° r bit 
intervals {..., -2, - 1, 0, ..., 8, ...}. In this modulator d,(t) and d Q (t) have 
replaced cos(0* - 0 O ) and d k cos(8 k - 0 O ), respectively, in (5.6-3) to obtain 
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“ a U) 


(b) 

Figure 5.6-2. MSK modulators, (a) Modulator using direct carriers and (b) using 
equivalent carriers. 

the MSK waveform. The modulator is called parallel because of the use 
of parallel data streams and two quadrature carriers. It is called type I 
following [9] to distinguish it from a second form (type II, discussed later) 
in which the half-cycles of cos(nt/2T b ) are rectified so that they do not 
alternate in sign. An equivalent modulator is shown in Fig. 5.6-2(b); it 
requires fewer product devices but requires two narrowband bandpass filters 
to select constant frequencies at the values indicated. 

Except for the slight difference in the carriers involved, the type I 
system is just an OQPSK system. The optimum receiver is, therefore, quite 
similar to the OQPSK receiver. It is shown in Fig. 5.6-4. The integrators 
are timed with the symbols of duration 2T b in each path. A sample is taken 
at the end of each interval to decide if the symbol was +1 or - 1 . The 
decision is in favor of + 1 if D, > 0, i = 1 or 2, and in favor of - 1 
otherwise. 











Figure 5.6-3. Timing diagrams for MSK modulators of Fig. 5.6-2. 


Figure 5.6-4. Receiver for MSK type I modulators of Fig. 5.6-2. 
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To determine the characteristics of the signal ,? MSK (/) generated as in 
Fig. 5.6-2, we consider the two time intervals 2 n - 1 and 2n centered at 
time t = 2 nT b , as shown in Fig. 5.6-3(a), (b), and (d). We readily write 


A£f 2 „_,cos( — jcos(m 0 / + 0 O ) 


- A</ 2 „_ 2 sin( — )sin(&> 0 / + 0„), (2 n - 1) T b *= / =s 2nT b 

wo-< b (5 

Ad in _ ,cos f j cos(a) 0 / + 0 O ) 

-Atf 2 „sin^~-jsin( £ e„/ + 0 O ), 2nT b « / (2« + 1)7;. 

By use of standard trigonometric identities, this equation reduces to 


COS (O 0 t + do - — 

V 27 V (2 n - 1 )T b =£ t 


, ,/“2«-l + ^2/1-2 1 l „ 77/ 

+ A \ 2 Jcos^wo/ + 0 O + — 


Jd 2 „-, - <*2n\ / . . „ rrt 

A v — 2 — j C0 V° r + - W b 


(5.6-7) 


+ a(- ^"-' 2 + ^" jcos(m 0 / + Oo + ^J, 2 nT b « / « (2« + 1)7*. 

Next, we recognize that (2« — 1)7), *£ / 2nT b corresponds to data interval 
k = 2n - 1, k odd. Similarly, 2nT b =£ / (2n + 1)7; corresponds to 

interval k = 2 n, k even. By writing (5.6-7) in terms of k, we have 


.(d k ~ d k - A / „ 77/ \ 

^ ^ )cos cu 0 / + 0 O - — I 

V 2 / V 27 V ft7; =s / *= (ft + i)T b 

Jd k +d k . ( 77/ * 0dd 

+ A( ' jcos(w 0 f + 0 O + 2 j) 


Jd k - „ 77/ 

2 Icosl OJ 0 / + 0 O — i^T 


,(d k + c/ t _A / 77/ 

+ ^ 2 )cos( co 0 / + 0 O + — 


(5.6-8) 


*/ l *r» « / * (ft + 1)7; 
\ (’ ft even. 
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A IT 

o> 0 + Aco - o> 0 + — 

when d k = d k _ , 

(5.6-10a) 

£0 0 - Aco = o> 0 - ~ 

1 b 

when d k f d k _ t . 

(5.6-10b) 


Although keying is between the desired frequencies, there is not a one-to- 
one relationship between d k and frequency deviations ±Acu. Deviation in 
interval ft depends not only on d k but on d k . t as well. Table 5.6-1 illustrates 
the waveforms that result from various combinations of d k and d k . x . By 
a slight modification the one-to-one relationship can be established. We 
next discuss this modification. 


TABLE 5.6-1. Waveforms Possible with Type I MSK. 


+ 

+ 



Waveform During Interval k 


k odd 

k even 

A cos (w„< + 0 O + Aw( + 0) 

A cos(iu 0 < + 0 O - Aa il + tt) 

A cos(gj a t + d 0 - Awr + 0) 

A cos(a> 0 7 + do 4- Aa>i + tt) 

A cos( + Aa>/ + 0) 

A cos(u 0 / + 0 O “ Acuf + 0) 

A cos (a> 0 / + 6 0 - Awf + tt) 

A cos(ttj 0 t + 6q + Aa)t + tt) 


Fast Frequency Shift Keying 

By differentially encoding the data waveform d(t) before it is applied 
to the type I MSK modulators of Fig. 5.6-2, the resulting MSK waveform 
will have a one-to-one correspondence between data waveform and transmitted 
frequency. The resulting system is often called fast frequency shift keying 
(FFSK) [9, 21], although we may still consider it a form of MSK. 

Differential encoding can be done on the data source’s binary digits 
using the encoder of Fig. 3.6-3(a) prior to polar-NRZ formatting. Decoding 
on the detected binary digits uses the decoder of (c). Alternatively, the 
encoder and decoder of Fig. 5.6-5 can be used directly with the polar- 









Figure 5.11-2. (a) Data waveform, (b) 
phases of direct and delayed FSK sig- 
nals to product device, (c) difference 
in phase between direct and delayed 
waveforms, and (d) detector response 
of the differential detector. 


where 


A 2 (l - K) 2 
4cr 2 (l + |p|) 
A 2 (l + Kf 
4cr 2 (l + |p|) 


(5.11-4) 

(5.11-5) 


(5.11-6) 
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Here p is the normalized autocorrelation function (autocorrelation function 
divided by a 2 ) and £>(•>•) is Marcum’s Q function defined by [41] 

Q(a,/3) = J~ xl 0 (ax)ex p - -y — - dx (5.11-7) 

Q(a, 0) = 1 (5.11-8) 

Q( 0 s/ 8) = exp(- (5.11-9) 

For sources of the tabulated Q function, the reader is referred to the 
references in [41]. 

If the shape of the predetection filter’s transfer function is chosen 
such that p = 0 when evaluated at delay r, (5.11-3) will reduce to 



(5.11-10) 


Because the form of (5.11-10) is the same as (5.10-27) for P e in DPSK, the 
potential exists for differential FM detection to perform as well as DPSK. 
Actual performance depends greatly on the selection of the predetection 
filter. 

If the predetection filter’s 3-dB bandwidth, denoted by B (Hz), is 
large, noise power a 2 is excessive and P e degrades. If B is too small, the 
signal waveform is distorted and intersymbol interference increases, which 
tends to degrade P e . Anderson, Bennett, Davy, and Salz, [47] studied an 
orthogonal FSK system (Acu = ir/T b ) that used a 4-bit delay and a cosine- 
shaped filter of bandwidth B = l/T b ; performance was degraded 4.8 dB 
relative to DPSK (which is about 5.7 dB relative to coherent MSK for P e 
near 10 -4 ).t Only 1.8 dB of this degradation was attributed to the de- 
modulation; 3 dB was considered to be the result of power in discrete 
spectral components that represent wasted power. Suzuki [48] studied an 
MSK system (1-bit delay) with a Gaussian shaped filter; he found that the 
optimum bandwidth B = L2l/T b gave a degradation minimum of 4.02 dB 
relative to coherent MSK. Crozier, Mazur, and Matyas [49] studied six 
filters: Gaussian, two- and four-pole Butterworth (both phase equalized and 
nonequalized), and ideal rectangular. They found degradation and optimum 
B in each case. Degradations ranged from 2.9 to 3.4 dB relative to coherent 
MSK; optimum B ranged from 1.0/7), to 1.2/7). The four-pole equalized 
filter gives the smallest degradation (2.9 dB) when its bandwidth is 1.1/7) 
[49], It was also found in [49] that degradation is minimum for large bandwidth 
lowpass filters following the detector. These studies tend to show that 
practical systems tend to give about 3 dB degradation relative to coherent 
MSK and the best predetection filter bandwidth is approximately 1.1/7). 




t Degradation is defined as the increase in signal-to-noise ratio required, relative to 
the reference system, to produce the same values of P, in the two systems. 
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5.12 NOISE PERFORMANCE COMPARISONS OF SYSTEMS 

To compare performance of various systems and to serve as a convenient 
reference, the various equations that give average bit error probability are 
summarized in Table 5.12-1. Entries are in order of decreasing performance 
(higher P e for e above about 5 dB). Equations are shown in terms of e, 
the average transmitted energy per bit divided by W 0 , because this parameter 
is a fair basis on which to compare systems. A few baseband system entries 
are also given for completeness. Plots of these expressions are shown in 
Fig. 5.12-1. 


TABLE 5.12-1. Average Bit Error Probabilities, of Various Systems. 


Equation 

Applicable Systems 

P ' = \ erfc[\/'e] 

PSK, QPSK, OQPSK, MSK, Polar-NRZ base- 
band, Manchester baseband 

P > = \ exp( — e) 

DPSK 

P, ~ ^ erfc 

Duobinary baseband, Modified duobinary 
baseband 

P. = ~ erfclVe/2] 
P, = “ exp( — e/2) 

ASK (coherent), FSK (coherent), CPFSK (or- 
thogonal), Unipolar-NRZ baseband 

FSK (noncoherent) 

p , ~ “fl + (2ne) ~ ,/2 ]exp( - e/2) 

ASK (noncoherent) 


Of the baseband systems, polar-NRZ and Manchester formats have 
a clear performance advantage over the duobinary formats, although these 
latter systems have a small advantage over the unipolar formatted system. 

Of the carrier modulated systems, the noncoherent forms (ASK, FSK) 
are only slightly poorer (by less than 1 dB) than their coherent versions 
when P e < 10 4 . Correspondingly, the DPSK system is less than 1 dB 
poorer than PSK under similar conditions. Such small losses in performance 
are often a minor price to pay for the implementation simplicity of the 
noncoherent systems, which explains, in part, why they enjoy such wide 
use. Of the noncoherent systems, the DPSK system clearly has superior 
performance compared to the noncoherent ASK and FSK systems; the 
performance advantage is 3 dB in e for a fixed value of P t . 
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5.13 SUMMARY AND DISCUSSION 

When some characteristic of a carrier signal, such as its amplitude, phase 
or frequency, is modulated by a digital message, the result is a bandpass 
waveform. This chapter examines a number of binary digital systems that 
transmit bandpass waveforms through the channel. These systems can be 
broadly classified as coherent, noncoherent, and semicoherent. 

A coherent system presumes knowledge of the transmitted carrier’s 
phase, timing of bit (symbol) intervals, and the forms (shapes) of the two 
possible transmitted waveforms. Waveform shape is presumed to be a priori 
knowledge. The first two quantities are usually determined by special circuitry 
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designed to observe the received waveform to obtain estimates of carrier 
phase and bit timing. These estimates essentially correspond to synchro- 
nization of the receiver with the transmitter. Synchronization is not a 
principal consideration in this chapter, and is assumed, for the most part, 
to exist without error. Within this constraint, the coherent binary system 
is found that maximizes the probability that a given symbol is correctly 
demodulated when the channel has white Gaussian noise and infinite band- 
width. This is defined as the optimum system and its structure is given in 
Fig. 5.1-1 for both correlator and matched filter forms. The average probability 
P e of a symbol error is given by (5.1-9). 

Special cases of the general coherent system are also developed. The 
amplitude shift keyed (ASK) system has the structure of Fig. 5.2-1 and its 
error probability is given by (5.2-5) when the two transmitted waveforms 
are equally probable. A system using phase shift keying (PSK) has the 
block diagram of Fig. 5.3-1 for equally probable messages; its error probability 
P e is given by (5.3-5). Quadrature PSK (or QPSK) is a variation of PSK 
that allows twice the data rate of PSK while requiring no additional channel 
bandwidth or average energy per bit. QPSK also gives the same bit error 
probability as PSK. Its modulator and receiver are given in Figs. 5.4-3 and 
5.4-4, respectively. Offset QPSK (or OQPSK) is a minor variation of QPSK 
that is slightly better than QPSK in some practical applications. 

Another special case of the general coherent binary system is frequency 
shift keying (FSK), where carrier frequency is modulated. Two versions 
of FSK are described for the case of equally probable messages. If keying 
is between independent oscillator frequencies at the modulator, the system 
of Fig. 5.5-1 applies; if the frequencies are chosen so that the transmitted 
waveforms are orthogonal (y = 0), P r is given by (5.5-11). A second version 
of FSK follows the use of a voltage-controlled oscillator modulated by the 
binary message; this form of modulation is called continuous phase FSK 
(CPFSK). The optimum receiver is again given in Fig. 5.5-1. Bit error 
probability in CPFSK is given by (5.5-22). The smallest frequency deviation 
Aw that makes the correlation between the transmitted waveforms in CPFSK 
zero (y = 0) defines minimum shift keying (MSK). 

Other possible implementations of MSK are next described. Some 
implementations used parallel forms for the modulator (Fig. 5.6-2) and 
receiver (Fig. 5.6-4) and are called type I. When the data stream at the 
modulator is precoded, the type I MSK system is called fast FSK (FFSK). 
A slight variation of the type I system leads to what is known as type II 
MSK (Fig. 5.6-6). Still another form of MSK is called serial (Fig. 5.6-7). 
Serial MSK has a simpler form than the parallel types and is preferred in 
some high data-rate systems. Average bit error probability, as given by 
(5.6-40) for equally probable messages, is the same for both the parallel 
and serial types of MSK. 


Chap. 5 References 321 

The chapter next turns to noncoherent bandpass systems where the 
receiver has no knowledge of carrier phase. Based on maximizing average 
probability of making correct bit decisions, the general optimum receiver 
structure is found to be nonlinear. Both correlator (Fig. 5.7-2) and matched 
filter forms (Fig. 5.7-4) are possible. Special cases considered are noncoherent 
ASK (Fig. 5.8-1) and noncoherent FSK (Fig. 5.9-1). Average bit error 
probability is given by (5.8-17) for noncoherent ASK and by (5.9-8) for 
noncoherent FSK, both for equally probable messages. 

No truly noncoherent form of PSK is possible. However, differential 
PSK (DPSK), a sort of semicoherent system, requires no carrier phase 
knowledge. The system (Fig. 5.10-1) uses the received waveform as its 
own reference. P e for this system is given by (5.10-27). 

Differential demodulation can also be applied to FSK and MSK wave- 
forms, as discussed in Sec. 5.11. Performance (P e ) for this type of demodulator 
is difficult to determine because it is heavily dependent on the form of the 
predetection filter that precedes the differential detector. 

The chapter ends by grouping the various equations defining P e for 
both bandpass and baseband systems (Table 5.12-1). Curves (Fig. 5.12-1) 
generally show that coherent systems outperform their respective noncoherent 
(or semicoherent) counterparts, although by less than 1 dB for P e < 10“ 4 . 
Of five baseband systems compared, those using polar or Manchester wave- 
form formats perform the same and are the most superior (equal also to 
the best coherent bandpass systems). A system with the unipolar format 
performs the poorest of the five formats. Duobinary and modified duobinary 
systems perform equally well and fall in between the other baseband cases, 
being less than 1 dB superior to the unipolar system. 
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PROBLEMS 

5-1. The two waveforms transmitted in a binary digital system are 
s 2 (t) = A sinj^jrectp — ^^jcoslaj,,; + 0 2 ) 

sft) = A sin^^jrectp — p^Jcos(a) 0 f + 6,), 

where 6, and 8 2 are arbitrary phase angles, (a) Find expressions for the 
energies in these signals, (b) For a specified value of T b , what values of 
o> 0 are allowed if the two energies are to be independent of ai 0 , 6 ,, and 0 2 ? 
(c) If <x) 0 T b » 4 tt, show that E t = £ 2 = A 2 T b /4. 

5-2. In an optimum coherent binary digital system, it is desired that the optimum 
threshold be zero. Show that this condition is equivalent to energies £, and 
Ei, probabilities P, and P 2 , and jV 0 satisfying 

£ 2 i „ r -Ei 

P 2 exp —— = P,exp — — . 

_ *T0 J 

5-3. For T b specified, determine what values of cu 0 will give zero normalized correlation 
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between the waveforms of Prob. 5-1. (Hint: Calculate the integral required 
to find y first.) If a> 0 » lir/T b , what does y become? 

5-4. In an optimum coherent binary system transmitted waveform energies per bit 
are £, = 5(10 4 ) J and £ 2 = 20(10" 4 ) J while y = -0.3. (a) If channel noise 
level is determined by Jf 0 = 1.6012(10“ 4 ) W/Hz, what is P, for this system? 
(b) If jV 0 is reduced to 66.39% of its value in (a) by improved receiver 
design, what does P, become? By what factor has P, improved? Assume 
P\ = Pi = J in every case. 

5-5. If £i = E 2 and y = — 1 in an optimum coherent binary system what minimum 
value of the ratio of average energy transmitted per bit to Jf 0 is required to 
give P, = 3.605(10 -3 )? 

5-6. In an ASK system (OOK variety), the probability of a pulse is P 2 and that 
of no pulse is 1 - P 2 . Plot curves for e = 2.5, 5.0, and 10.0 to show how 
the optimum normalized threshold V T /Jf 0 of the receiver deviates, as a ratio, 
from its value when P 2 = 1 - P 2 . That is, plot [(V T /JfJ - e]/e for various 

Pi- 

5-7. Use (5.2-4) to show that the responses of the receiver of Fig. 5.2-l(c) to the 
signals of (5.2-1) are identical to those of the receiver of (b) if all responses 
are evaluated at the sample time t - T b . 

5-8. An optimum ASK system, for which P, = P 2 and P, = 5.715(10“ 4 ), uses 
pulses with peak amplitude 0.2 V. On the channel = 10~ 5 W/Hz. 
(a) What peak signal-to-noise ratio occurs at the sampler at the sample time? (b) 
What is T b I 

5-9. An optimum coherent ASK system produces a peak signal-to-noise ratio of 
12.5 (or 10.97 dB). What is P r if P, = P 2 = J? 

5-10. By integration of (5.2-11) find the power in (a) the discrete spectral components, 
and (b) the remaining terms, (c) What should the total average power be? 
Does the sum of powers in (a) and (b) equal what it should? 

5-11. An ASK signal does not go to full off when a code 0 is transmitted. If the 
two peak carrier levels are A t and A 0 , obtain an expression for the transmitted 
signal’s power density spectrum. Assume a> 0 T b is an integral multiple of tt 
if needed. 

5-12. In the ASK system of Prob. 5-11, assume message probabilities are P, = 
Pi = i. A, = 3 V, A 0 = 1 V, T b = 1 ms, and = 8.681(1<T 5 ) W/Hz. 
(a) Find the optimum threshold voltage, (b) Find bit error probability P r . 

5-13. A coherent ASK system (OOK variety) transmits with a peak voltage of 5 V 
over a channel having an unknown loss. If A' 0 = 6.0(10” ls ) W/Hz, bit 
duration is 0.5 jus and the system performs with P, = 1 0 4 , what is the loss? 
Assume message probabilities are £, = P 2 = h 

5-14. Work Prob. 5-13 except for a channel having P, = 2.035(10* 4 ). 

5-15. In a PSK system co 0 is an integral multiple of 2Tr/T b . Ifs 2 (t) = A sin(7r//7' f ,)sin(<uo'' 

+ 0 O ) for 0 « t T h and zero elsewhere and s,(f) = -s 2 (t), sketch the block 
diagram of a correlation-type optimum receiver (for white noise). Find the 
possible signal components of output voltage at the sampler at the sample 
time. Assume £, = £, = i 
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5-16. Show that the system of Fig. 5.3-1 remains optimum even if o> 0 is a multiple 
of half the bit rate a> b if the local carrier is phase coherent with the transmitted 
carrier. 

5-17. Find the transfer function of a matched filter that can replace the integrator 
in Fig. 5.3-1 without loss of performance. Prove that the filter produces the 
same signal responses and noise power at the sample time as the integrator. 

5-18. PSK and coherent ASK systems use the same peak pulse amplitudes and 
have the same values of P, and jV 0 . How are the bit durations T b related? 

5-19. In a PSK system P t = 10' 7 , A = 10 V, P, = P 2 = i, = 3.69(10' 7 ) 
W/Hz and the bit stream is the result of time multiplexing bits from N binary 
PCM encoded sources. What is N if the original analog messages (all similar) 
had spectral extent W f = tt( 10 4 ) rad/s and were sampled at the Nyquist rate 
with 5-bit PCM encoding? 

5-20. Find P e for a PSK system having e = 10. If local carrier phase causes the 
signal levels at the sampler to decrease by 1.5 dB, to what value will P , 
change? 

5-21. For the PSK system of Prob. 5-19 what is the peak signal-to-noise power 
ratio that occurs at the sampler at the sample time? 

★5-22. As indicated in the text, show that two coherent PSK systems that use the 
same carrier but in phase-quadrature — that is, one is cos (<a 0 f + So) while 
the other is sin(<a 0 r + 6 a )— operate independently if oi 0 T b = mir, m a positive 
integer. The proof must show (a) that the output signal of either system is 
not dependent on the other at the sample times and (b) that noises out of 
the two systems are independent at the sample times. 

5-23. Two separate digital messages each with bit rat e/* = 1/7), = 10 4 bits/s are 
conveyed to a receiver by QPSK over a channel for which = 9.645(10"’) 
W/Hz. If the composite QPSK signal’s peak amplitude is 2.5\/2 V, what 
is the probability that either or both of the messages will have a bit error 
occur in a given bit interval? 

5-24. (a) Sketch the signal constellation for a QAM waveform using polar waveforms 
of ±2 V (waveform 1) and ±3 V (waveform 2). (b) Demonstrate that the 
QAM signal has constant amplitude in every bit interval, (c) What are the 
four phases possible in the QAM signal? What is its amplitude? 

5-25. (a) Sketch the three polar waveforms applicable to a OQPSK modulator for 
a source data sequence 0101110010010. (b) Sketch the carrier phase values 
with time for this sequence. 

5-26. Assume a QPSK signal and no noise are applied to the network of Fig. 5.4- 
5(b) modified to use a fourth-power nonlinearity, filter at 4<u 0 and -t-4 divider. 
Find the divider’s output signal. What phase shifts are now needed? 

★5-27. Represent a QPSK input signal to the Costas loop of Fig. 5.4-6 by c u cos(co 0 f 
+ 0o) ~ c 2 jSin((u 0 t + 0 o ), where c u = +A in symbol interval k according 
to odd-numbered bits in a data message stream and c u = +A according to 
the even-numbered data bits. Let the VCO output be sin(o> 0 f + 0 O + <W> 
where <f>, is a small phase error, (a) Find the VCO’s input control voltage 
and show that it tends to force <f>, to zero, (b) Add a phase shift (ambiguity) 
e m = mull, m = 1, 2, 3, 4, to the input QPSK signal and show that the 
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same control voltage is obtained as in part (a). Assume no noise in both 
(a) and (b). 

5-28. Work Prob. 5-13 for a coherent FSK system that uses independently switched 
oscillators. 

5-29. A nonorthogonal CPFSK system has e = 9.68, \a>T b = 2.0 and transmits 
bits of 100 jus duration on a high-frequency carrier where (hJItt » 1/7). 
In the receiver the average noise power at the sampler is known to be 
ai = 1 .92(10* 7 ) W. Find: (a) y, (b) (c) A, and (d) P r for this system. 

5-30. Find an equation for the power spectrum of the transmitted FSK waveform 
in Prob. 5-29. Plot the spectrum normalized to (A 2 T b /2) versus the variable 
-0.7 =£ (oj - oj 0 )/<u t ss 0.7. Compare your results to those in Fig. 5.5-3. 

5-31. Amplitudes for the first 16 bit intervals of a data waveform used in CPFSK 
are defined by {1, -1, -1, 1, -1, 1, 1, 1, -1, 1, -1, 1, 1, -l, -l, - 1}. 
If A <0 = tr/47), for this system, construct a phase trellis showing the history 
of phase due to modulation. Assume phase is zero at / = 0 and note this 
is not an MSK system. 

5-32. Find expressions for the carriers that form inputs to the products in Fig. 
5.6-2(b) that have other inputs d,(i) and d Q (t). Does your work verify that 
the two networks of (a) and (b) are equivalent? 

5-33. Assume a rectangular symbol pulse rect[(/ - T b )/2T b ] is applied to a passive 
pulse shaping network as in Fig. 5.6-6(a). If the desired response must be 
cos[ir(/ - 7))/27],]rect[(f - T b )/2T b ], show that the required network is 
unrealizable. 

5-34. Begin with (5.6-13) and use (5.6-11) to show that type II MSK with differential 
encoding of data does not give a one-to-one correspondence of FM with 
data. Give a table similar to Table 5.6-2 to support your conclusion. 

5-35. Show that (5.6-17) and (5.6-16) are equivalent. 

★5-36. By using the power spectrum of (5.6-32) show that the total power in an 
MSK signal is A 2 / 2. (Hint: Use Parseval’s theorem.) 

5-37. A serial MSK system transmits a signal with 1.2-V peak amplitude and 9(10" 4 ) 
J/bit energy over a channel defined by jV 0 = 1.8(10' J ) W/Hz. Find: (a) the 
peak signal power to average noise power ratio at the sampler at the sample 
time, (b) average bit error probability P,, and (c) the duration of a bit. 

5-38. Show that (5.7-24) can be restated in the form 



Note that in an optimum coherent system, this condition is equivalent to 
having a zero threshold V T [see (5.1-3)]. 

5-39. Show that (5.8-9) is true. That is, show that the matched filter with transfer 
function H 2 ((u) found in Example 5.8-1 produces a peak envelope vv = 
E 2 = A 2 TJ2 at its output when its input is a typical ASK pulse. 

★5-40. Figure 5.8-2 applies to noncoherent ASK system output voltages from the 
envelope detector at the sample times. Densities are given by (5.8-12) and 
(5.8-13) and is given by (5.8-9). (a) As a second derivation of ASK 
results, set up the error probability integrals assuming V T is arbitrary. By 
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differentiation show that the optimum V T occurs at the crossover of the 
densities when data bits are equally probable, (b) From your result in (a) 
use (5.8-9) and (5.8-10) to show it to be equivalent to (5.8-6). 

★ 5 - 41 . Carry out the steps indicated in the text to obtain (5.8-17). 

5 - 42 . Use (5.8-16) to obtain an approximate expression for P, in a coherent ASK 
system. Relate the coherent and noncoherent ASK systems to each other 
using (5.8-17) and the approximate expression. If e = 23.8 in both systems, 
what are the two error probabilities? Check your values with Fig. 5.12-1. 

5 - 43 . Assume both 2w 0 T b and 2AioT b are integral multiples of n and show that the 
noises at the matched filter outputs in Fig. 5.9-1 are statistically independent, 
if taken at the same time. Assume white, zero-mean Gaussian channel noise. 

5 - 44 . Consider P, for noncoherent FSK as given by (5.9-8) and (5.5-1 1) for coherent 
FSK. (a) Use the approximation of (5.8-16) and derive the relationship 
between e in coherent FSK (denote by e c ) and e in the noncoherent system 
(denote by e„) that must hold if both systems give the same values of P, . 
(b) Show that e„/b c -* 1 as e c — > oo — that is, both systems require the same 
e and neither has any advantage over the other for large e. 

5 - 45 . Work Prob. 5-13, except assume a noncoherent FSK system. 

5-46. In a certain noncoherent FSK system e = 20.0. If the channel suddenly 
increases 1 dB in attenuation, by what factor will P, increase? Find the two 
values of P r . 

5 - 47 . Show that the DPSK signal of (5.10-3) causes the response of (5.10-6) at the 
output of the receiver’s matched filter defined by (5.10-4) and (5.10-5). 

5 - 48 . Use (5.10-6) to obtain expressions for the direct and 1-bit delayed signals at 
the DPSK receiver’s product device in Fig. 5.10-1 that are applicable in bit 
interval k— that is, for kT b *& t (k + 1)1*. For t near ( k + 1) 7*, do 
your results yield (5.10-14) and (5.10-15)? 

★ 5 - 49 . Justify that the powers in the noises ft, ft, and a, and the noise part of a,, 
as given by (5.10-20), are all the same and equal to NJA, where N , is the 
DPSK system noise power at the matched filter’s output. 

5-50. A DPSK system uses the same energy per bit and the same channel (same 
A'o) as a PSK system for which e = 5.1. Find P r in the two systems. Do 
your values agree with results from Fig. 5.12-1? 

5 - 51 . An FSK signal having A<u = 2ir/2T b is to be differentially detected, (a) Find 
the required detector delay r. (b) Sketch waveforms analogous to those in 
Fig. 5.11-2 if the data sequence is {-1, 1, -1, -1, 1, -1, 1, l, l, - 1} for 
the first 10 bit intervals past t = 0. 


★Chapter 6 

M-ary Digital 
Systems 


6.0 INTRODUCTION 

Except for a small number of digressions (M-ary PAM in Chap. 4; QAM 
and QPSK in Chap. 5), our preceding work on digital systems has concentrated 
on binary message sources. There are, however, a number of current-day 
applications that involve nonbinary systems. We call these M-ary systems, 
which refers to the fact that the transmission interval can now contain any 
one of M possible waveforms as opposed to only two in the binary system. 
The transmission interval is called a symbol interval (bit interval in the binary 
case), and its duration will be labeled T (instead of 7), in the binary system). 

To define the M-ary system more carefully, we consider Fig. 6.0-1, 
which is an extension of the binary system of Fig. 4.1-1. The message 
source can now generate any one of M messages in the set {m,} during 
any symbol interval. To each message the modulator assigns one waveform 
of a set of M waveforms {r,(/)} for transmission over the channel. Thus 
each waveform is uniquely assigned to one message; its probability of being 
transmitted is equal to the message’s probability of occurrence, denoted 
by 

P t = P(m,), i = 1,2 , ..., M. (6.0-1) 

These signals are assumed to be nonzero only in the symbol intervals in 
which they are transmitted, so they have duration T. 

The waveform at the receiver, denoted generally by r{t), consists of 
the transmitted signal corrupted by broadband channel noise that we model 


329 


White Noise 


Sec. 6.1 Vector Representation of Signals 


331 



as Gaussian white noise njt). The power density of njt) is denoted as 
-V|)/2, — oo < c < co. Hence 

r(t) = r,(f) = s,(t) + njt), i = 1,2, .... M. (6.0-2) 

We seek the receiver that is optimum under certain constraints. For 
example, we assume that it observes r(t) during each symbol interval and 
then decides which of the messages m,, i = 1, 2, ..., M, was most probably 
sent. The system is constrained, therefore, to make symbol-by-symbol 
decisions. The overall system output, represented by m, consists of one 
of the M possible messages in each interval. 

The optimization criterion that we use is similar to those used for 
binary systems in Chaps. 4 and 5. That is, the receiver makes the optimum 
decision by choosing, in each interval, that message with the largest probability 
of having been sent, given that the observed waveform r(t) occurred. The 
analysis procedure will again depend on representing the received signal 
and noise waveforms by a suitable set of random variables. In the binary 
analyses these random variables were generated by samples taken by the 
receiver over the bit interval. Although the sampling technique was simple 
and direct, we shall adopt a second, more elegant and powerful, approach 
in this chapter. The approach is based on representing both signals and 
noise by a vector (geometric) model. In the following sections we first 
establish these vector representations and then develop the optimum systems. 
Our work is based heavily on the pioneering works of Rice [1] on noise 
theory, and KoteTnikov [2] on waveform and system modeling, as well as 
other literature giving refinements and extensions of the earlier theory, such 
as Golomb [3], Wozencraft and Jacobs [4], Viterbi [5], Sakrison [6], and 
Lindsey and Simon [7], to name just a few. 


★6.1 VECTOR REPRESENTATION OF SIGNALS 
Orthogonal Functions 

Let {<b„(t)} be a set of possibly complex functions of time with the 
property that 

( 6 . 1 . 1 ) 

where t 2 > t x and X is a real constant (the energy in 4>„(t) on the interval 
[<i, 12 ])- The functions are called orthogonal on the interval [/,, r 2 ] due 
to their behavior in (6.1-1). They are said to be orthonormal if X = 1 (unit 
energy waveforms). 

To illustrate that orthogonal functions are not new to the reader we 
develop a simple example. 
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Example 6.1-1 

Consider the set of cosines {cos(n2trt/T)}, n = 1,2, .... where T is the period of 
the lowest-frequency waveform. Let /, = 0 and f 2 = T in (6.1-1). Then 


s: 


cos I 


( nlrrt 
T 


cos ■ 


/ m2TTt\ 


\ T 


dt = 


if 

2 Jo 
+ cos 


cos 


(n + m)2TTt 

y 


(n - m)2rrt 1 

. T JJ 


dt 


sin[(n + m) 2tr] sin[(« - m)27r] 

2(n + m)2Tt/T + 2 (n - m)2ir/T 


' 0 , 

T 

X 


n f m 


and we find that the cosines are orthogonal, t By dividing by \jT/2, the set 
{\/2jf cos( n2iTt/T)} contains orthonormal cosines. 

By repeating the above procedure the waveforms {\/2/f sin(n2irt/T)}, n = 
1, 2, ..., are also found to be orthonormal on [0, T). 


Vector Signals and Signal Space 

One of the most important uses of orthogonal functions is in representing 
waveforms. The reader is already familiar with two such representations. 
The sine and cosine functions of Example 6.1-1 may be used to define a 
periodic function through the Fourier series of (A.4-2). (Also see Prob. 
6-2.) In a similar manner the orthonormal exponentials can be used in the 
complex Fourier series (A. 4-6) to develop a second representation of an 
arbitrary periodic waveform (Prob. 6-3). 

Of more interest to the Af-ary system of Fig. 6.0-1 is an orthogonal 
function representation of the transmitted signals WO- We summarize the 
important ideas in a theorem and then illustrate the theorem’s proof: 

Theorem. Let WO, * = 1, 2, ..., M be Af (possibly complex) signals 
that are arbitrary except they are all nonzero only for 0 t *£ T and have 
finite energy 

E, = f MOf dt <oo, i= 1, 2, (6.1-2) 

■Jo 

Then a set of TV =s Af orthonormal “basis” functions 4> n (t), n = 1,2, ..., 


+ From L’Hospital’s rule, lim Ar _, sin(x) = 1, which is needed for the case m = n. 

x ~*0 


Sec. 6.1 Vector Representation of Signals 333 

N, that are nonzero only for 0 I can always be found such that 

N 

Si(t) = 2 S in4>n(t), i = 1,2, ...,M (6.1-3) 

1 

where 

s ln = f Si(t)<ti(t) dt, (6.1-4) 

-'o 

for i = 1, 2, ..., M and rt = 1, 2, ..., N. 

Proof of the above theorem consists of giving a procedure, called the 
Gram-Schmidt procedure, that will always yield the set {<£„(?)}. First, how- 
ever, we shall introduce the important concept of representing a waveform 
by a suitably defined vector. 

Observe from (6.1-3) that once the functions ) have been determined 
that apply to all M waveforms «,(()> each waveform is completely defined 
by its coefficients s in . We visualize these coefficients as components of an 
TV-dimensional geometric space we call the signal space that has TV mutually 
orthogonal axes. Each axis, or coordinate, is associated with one of the 
orthonormal functions. From this point of view the M signals s,(t) are 
defined by M points in the TV-dimensional signal space through their vectors 
(j M , s, 2 , ..., s iN ), i = 1, 2, ..., Af; vector component (amplitude) s n is 
associated with coordinate <f>,, component s i2 with coordinate <f> 2 , and so 
on, for signal i. An example will help make these concepts more concrete. 


Example 6.1-2 

Consider M = 4 signals, the first of which is ri(f) = A cos(o» 0 / + 0 0 ) + A 
sin(c o„t + d 0 ), where co 0 is a positive integer multiple of 2ir/T\ A and 0 O are constants. 
As in Example 6.1-1, these two sinusoids can be shown to be orthonormal if we 
define 


d>, (0 = ^-cos(o) 0 r + 0 O ), 


WO 




sin(oi 0 r + do). 


0 « t =£ T 


O^t^T. 


We use these results to rewrite s,(t) and define other waveforms as follows: 


WO = J^<t>iU) + yf WO 

WO = -A cos(o> 0 t + $o) + A sin(o> 0 f + 8 0 ) 

= + yf wo 

WO = -A cos(o) 0 r + do) - A sin(co 0 r + 6 0 ) 

--Vfwo-Vfwo 
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£»(0 - A cos(co 0 t + d 0 ) -A sin(<u 0 r + d 0 ) 

alljor 0 « t *= J, where E = A 2 T. Signal j,(/) is defined by its vector (\/E/2, 
\/E/2) in N = 2-dimensional signal space having coordinates </>, and <t> 2 . Others 
are similarly defined. These vectors are illustrated in Fig. 6.1-1. We call the array 
of signal vectors a signal constellation. All vectors have the same length (called 
the norm), which means all four signals S/(t) have the same energy per symbol. 



The signals above can also be written as 

*/(0 = \f2 A cos a ) 0 t + 0 O - - 2 ' , i=l, 2, 3, 4, 

which shows that they collectively form a 4-ary (4-phase) PSK signal. Since this 
same signal constellation was observed in QPSK (Fig. 5.4-1), we conclude that 
QPSK is the same as 4-ary PSK. 

Gram-Schmidt Procedure 


Proof of the theorem involving the signal representation of (6.1-3) 
consists of giving a procedure whereby the orthonormal functions <t> n (t) are 
found. The procedure is outlined in the following steps [4, 6, 8, 9] called 
the Gram-Schmidt procedure. 


Step 1. Order (number) signals so 5 ,(r) i= O.t 
Find the energy in s,(r): 



t It is possible that a desired transmission could be no pulse. 


(6.1-5) 
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Normalize 5i(t) and let the result be the first orthonormal function 

*.(*): 


Clearly, 


so that 


ut) = lvi:r (0 - 


s,(t) = V£, ut) = sM), 


s,, = V^i- 


Step 2. Find the projection of s 2 (t) into coordinate <j>, by 
■* 2 i= r s 2 (t)<l)f(t) dt. 


( 6 . 1 - 6 ) 


(6.1-7) 


( 6 . 1 - 8 ) 


(6.1-9) 


Subtract the component of s 2 (t) that is in the 4> x direction; call the 
remainder an auxiliary function 6 2 (t) for this second step: 

Ht) = s 2 (t) - s 21 0,(/). (6.1-10) 

If 0 lit) — 0, disregard s 2 (t), renumber remaining signals and repeat 
this step for a new s 2 (t) so that 0 2 (t) / 0. Compute the energy in 
0 2 (t). The energy becomes the square of the component of s 2 (t) in 
direction 4> 2 . Thus 


&02 — f l®j(f)| 

-'o 


Next, set 


Hence 


\ 2 dt = s\ 2 . 


MO = (1 /VSfcWO. 


( 6 . 1 - 11 ) 


( 6 . 1 - 12 ) 


s 2 (t) - 0 2 {t) + s 2i <f>i(t) 

= VE^MO + s 2l d> t (0 (6.1-13) 

= s 22 (f) 2 (0 + S 2 l<l>l (0 
from (6.1-10) and (6.1-12). 

Step 3. Find the projections of s 3 (/) in coordinates <f>, and <j> 2 : 


= f s } (t)4>K0dt 

-'o 

= r siuwiodt. 

-'o 


(6.1-14) 


(6.1-15) 


Subtract projections from s 3 (t) to obtain the auxiliary function 6 } {t): 
6 3 (0 = s 3 (t) - s 32 </> 2 ( t ) - Sj^jU). (6.1-16) 
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If 0 3 (f) = 0, disregard s 3 (l) and renumber remaining signals so 0 3 (f) 
0. Compute energy in 0 3 (t) and normalize to obtain fft): 

E e , = f Htfdt = 4 (6.1-17) 

-'O 

Ut) = (1 /V£*> } (0. (6.1-18) 

Clearly, 

s 3 (t) = s 33 </> 3 (f) + s 32 </> 2 (0 + s 3 i^>i(t). (6.1-19) 

Step 4. Continue by repeating the procedure of Step 3 until no auxiliary 
functions can be found that are nonzero. There will be N « M functions 
If the M signals s,(t) are not linearly independent, it will result 
that N < M. If all are linearly independent, then N = M. This is 
the end of the procedure. 

We call N the dimension of the signal space. The set {</>„(/)}, « = 
1, 2, ..., N is said to be a basis function set that spans the ^/-dimensional 
signal space. 

Signal Energy and Average Power 

Energy, denoted by E h in signal s,(t) is related to its vector representation 
(6.1-3) as follows 

J '-T fT N N 

k(0| 2 * = I 2 2 S *m<i>m(t)dt 

0 J 0 „=1 m-1 (6.1-20) 

N N rT N 

= 2 E s iA I <t>f t)4>*u) dt = 2 Ml 2 - 

I m = 1 n = 1 

Average power P, on [0, T] is energy divided by T, or 

Ml 2 . ( = 1, 2, M. (6.1-21) 

1 n-l 

★6.2 VECTOR REPRESENTATION OF NOISE 

After having introduced the vector representation of signals, we next extend 
the concepts to include noise so that they may be used in the analysis of 
the system of Fig. 6.0-1. 

Basic Theorem. Let X(t) be a zero-mean stationary, possibly complex, 
random processt with a continuous autocorrelation function /?*(•), and let 


t The developments can also be extended to include nonstationary processes [10]. 
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{t/f„(/)} be a set of functions orthonormal on the interval t, « / =£ t 2 that 
satisfy the integral equation 

f R^t - 0W0 d{ = |o-„ !>„(/), t, « t t 2 (6.2-1) 


where <x„ is a constant, then X( t) has the representation (called the Karhunen- 
Loeve expansion) 


X(t) = 2 x n ut) 


( 6 . 2 - 2 ) 


where the coefficients X n are random variables given by 


xumodt. 


(6.2-3) 


The random variables X n have the property 

m;xj - " t " <«•«) 

which means they are uncorrelated^ The equality in (6.2-2) is taken in the 
sense of zero mean-squared error e 2 t 


= lim E 

N->°° L 


X(t) - 2 XMt) = o. 


(6.2-5) 


We shall not give a proof of this theorem. A proof is available in [10] 
(see also Probs. 6-13-6-15). Rather, we shall apply the theorem to our 
specific problem. 

White Noise Case 

For a white noise random process N(t) of power density X 0 /2 we 


( 6 . 2 - 6 ) 


so (6.2-1) with ti = 0 and t 2 = T becomes 


R.wit - OUi) dt 


{ -f)Ht - tmodt 

\ 2 / (6.2-7) 

<Pn(0 = M|M(f). 


t The set {</«„(/)} is said to be complete if e 2 -» 0 as N -* 

t We justify the need for a continuous noise autocorrelation function by treating white 
noise as the limit as bandwidth becomes infinite of bandlimited noise of constant power density 
-V„/2 as shown in Fig. 4.1-2. 



338 


339 


M - ary Digital Systems Chap. 6 

Thus we find that any complete orthonormal set will satisfy (6.2-7) and 

k„| 2 = f . (6.2-8) 

When these results are combined with (6.2-2)-(6.2-4), we have a representation 
of white noise N(t) 


MO - 2 MA(0 

N, 


f Numodt 

-'0 


valid on [0, T], where 

( 0, n i= m 

J{ 0 

T- n = w ' 


(6.2-9) 

( 6 . 2 - 10 ) 


( 6 . 2 - 11 ) 


★6.3 OPTIMIZATION OF THE M-ARY DIGITAL SYSTEM 

We are now in a position to develop the optimum M - ary system of Fig. 
6.0-1 based on the preceding vector modeling of signals and noise. 

Signal and Noise Vectors 

The transmitted (received) signals {s,(t)} have been represented by the 
orthonormal representation of (6.1-3), where the set of orthonormal functions 
{<£„(*)} was found by the Gram-Schmidt procedure. White noise has the 
representation (6.2-9), where the functions t|/„(f), n — 1,2, ...» can be any 
complete set of orthonormal functions. For analysis purposes we desire 
both signals and noise to use the same orthonormal functions. The conversion 
is readily accomplished by first arranging signals and noise functions <|/„( t ) 
in a sequence [6, p. 236]: 

*,(/), s 2 (t), s M (t), t,(l), HO, ••• • (6.3-D 

We then use the Gram-Schmidt procedure to form a new set of orthonor- 
mal functions based on the sequence. Clearly, the first N functions will 
be those found before for the signals. They were labeled HO, « = 1> 
2, .... N. The set now extends beyond N due to the signals ip n (t). 

Henceforth in this chapter we shall assume all signals and noises are 
real functions. From our procedure above, the signals become 

s,(0 = 2 s.jHO, i=\,2,...,M, (6.3-2) 

j - 1 
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where 

s u = f s,(OHO dt, i = 1, 2, .... M. 

-'O 

White Gaussian noise is represented by 


(6.3-3) 


n„{t) = 2 n jHO, (6.3-4) 

j - 1 

where the n s are zero-mean statistically independent Gaussian! random 
variables: 



njt)4>j(0 dt 

(6.3-5) 

j f m 

(6.3-6) 

, j = m. 



Noise terms in (6.3-4) for j > N exist in coordinates orthogonal to all 
coordinates for j *£ N. These terms therefore have no affect on the coordinates 
in which the signals are defined, and it is known [4] that the terms are 
irrelevant to the optimum receiver. On dropping the irrelevant noise, we 
denote the relevant noise by n(0- 

N 

n(0 = 2 n jHO- (6.3-7) 

j - 1 


In general, the statistically optimum receiver can observe only the 
received waveform in deciding which message was transmitted in a symbol 
interval. It is, however, reasonable to assume it has symbol synchronization 
as well as knowledge of the shapes of the possible transmitted waveforms 
and of message probabilities P(m,) = P t . Since r(t) can be represented 
by its vector random variable (r, , r 2 , ..., r N ) according to 

r(0 = 2 r jHO (6.3-8) 

j-i 

T 

r(t)HO dt, j = 1, 2, .... N, (6.3-9) 

its statistical properties are determined by this vector. Now let p(t) be a 
particular value of r(t). Since r(t) = s,(t) + n(t) in general, we have, in 



t Since n w (t) is assumed Gaussian, the coefficients n, must be Gaussian. Because the 
tij are uncorrelated and Gaussian, they must also be statistically independent. 
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particulart 

N 

pit) = S/(f) + n(l) = 2 Pj<t>jit), 
j - 1 


(6.3-10) 


where 

Pi = / pitWjit) dt = s v + nj. (6.3-11) 

-'o 

Decision Rule 

We call the receiver optimum if it chooses as the transmitted message 
that which has the largest probability of being sent, given that p(/) is 
observed in the symbol interval. Thus the optimum receiver sets the message 
estimate to m = m k if, for i = 1, 2, ..., M, 

P(>rik\n = p,, ..., r N = p N ) > P(m , ,| r, = p,, ..., r N = p N ), i ± k. 

(6.3-12) 

Equation (6.3-12) defines the optimum receiver’s decision rule. The prob- 
abilities indicated are called a posteriori because they relate to message 
probabilities after the fact, that is, after a specific waveform has been 
received. The receiver based on (6.3-12) is, therefore, called a maximum 
a posteriori (MAP) receiver. 

By use of Bayes’ theorem (Prob. 6-16), it can be shown that 


P( m i | r i = p, , ..., r N = p N ) = 


Prir, = p„ ...,r N = p N \ m,)Pi 
Pri r 1 = Pi. •••> C N = Pn ) 


(6.3-13) 

where p r (r,, ..., r N \ m t ) is the conditional joint probability density of the 
random variables defining r(t), and p r (r u ..., r N ) is the density without 
condition. On substituting (6.3-13) into (6.3-12), the MAP decision rule 
becomes 


Prin = f/v = Pn I m k )P k > p r {r , = p,, ..., r N = p N | m,)P,, i ± k. 

(6.3-14) 

A receiver that uses (6.3-14) for a decision rule, except without regard 
for probabilities P h is called a maximum likelihood receiver [4]. When all 
messages are equally probable, the maximum likelihood and MAP receivers 
are the same. 

In our system noises n } are Gaussian and statistically independent. 
Thus using (6.3-6) and (6.3-11), we have 

f 1 N 

P rip Pn I m,) = (ttvVo) W2 exp j 2 iPj ~ s u ) 2 



t We think of p, as specific values of the random variables r k in (6.3-8). 


The decision rule then reduces to choosing m = m k if 

\ N i N 

In (P t ) ~ T I ( Pi - 5 W ) 2 > ln(P f ) - — 2 {pj - Sij) 2 , 

^Oj-l ^0i-\ 


where In(-) represents the natural logarithm.! 

Decision Regions 


i + k, 
(6.3-16) 


Equation (6.3-16) is helpful in defining those specific values pj that 
correspond to a correct message decision. First, we observe that 2^1, 
(pj - Sij) 2 is the square of the Euclidean distance (norm) between the 
received waveform vector (p,, ..., p N ) and the vector (j n , ..., s iN ) of signal 
s t (t). For equally probable messages (6.3-16) is equivalent to deciding in 
favor of the message having the least distance between the received waveform 
vector and the message’s waveform vector. The region of all such vector 
points (pi, ..., p N ) in N-dimensional signal space that are closer to point 
(5/i, s iN ) than any other point is called the decision region for message 
i and is denoted by /, . Boundaries of /, for equally probable messages 
become perpendicular bisectors of lines drawn between signal points, as 
illustrated in Fig. 6.3-1 for M = 4 messages and N = 2 dimensions. When 
messages have unequal probabilities, boundaries are shifted according to 



Figure 6.3-1. Decision regions for four signals in two-dimensional signal space. 

t In reducing (6.3-14) the natural logarithm has been used for convenience; this is an 
allowable operation because the natural logarithm is a monotonic function of its argument. 
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(6.3-16) toward the signal vector having the smaller probability P, of each 
vector pair. 

Generally, the determination of decision regions is quite difficult. Only 
in some simple (but useful) situations can the regions be evaluated. For 
example, with equally probable messages and signal vectors on a rectangular 
lattice, decision boundaries are rectangular [4, p. 253], 

Error Probability 

The probability of symbol error, denoted by P w , is 1.0 minus the 
probability that a symbol is correct. The probability of a correct symbol 
is obtained by averaging all of the conditional probabilities of a correct 
symbol, denoted by P(C \ m,). 

Thus 

M 

P„= 1 ~'ZP{C\ m.j)P,. (6.3-17) 

1-1 

Since P(C | /n ; ) is the result of points (p,, ..., p N ) falling in decision region 

, we have 

= 1 — 2 Ri f ■■■ f Pr(pi< ■ ■■,Pn\ m i) dp, dp N . (6.3-18) 

i- 1 J h J 

In certain special cases where regions /, can be specified, (6.3-18) can be 
evaluated using (6.3-15). We subsequently consider some of these cases. 

Some Signal Constellation Properties 

We state some useful properties of signal vectors and decision regions 
when messages are equally probable, are transmitted on a white Gaussian 
noise channel, and M > 2. For additional discussion and proofs the reader 
is referred to Wozencraft and Jacobs [4], 

1. The probability that a received signal vector (p,, p 2 , p N ) falls in 
decision region /, corresponding to signal vector (s it , s l2 , ..., s iN ) is 
unaffected if the signal vector and /, are translated together through 
signal space. 

2. The probability in property 1 above is also unaffected by a rotation 
of /, about the signal vector of s,(t). 

3. The origin of signal space can be shifted to a new location that minimizes 
the average energy in the signals j,(r) without affecting the probability 
of symbol error. The new origin is the centroid of the signal constellation. 

4. Systems with the same signal constellations but different orthonormal 
functions will give the same minimum symbol error probability. 
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★6.4 OPTIMUM RECEIVERS 


Even though decision regions and error probability may be difficult to 
calculate, the structure of the optimum receiver can readily be found. By 
expanding the squares in (6.3-16) and using (6.1-20), the decision rule can 
be written as: choose in = m k if 

D k > Di , all i + k, (6.4-1) 

where we define 

D, = R, - B,, (6.4-2) 




In (Pi), 


(6.4-3) 


and 


N 

Ri = 2 Pj s a- 


J - ! 


(6.4-4) 


We refer to /?,, and Z), as response, bias, and decision variables, re- 
spectively. The optimum receiver structure depends on how R, is represented. 


Correlation Receiver Structures 

First, let s u in (6.4-4) be replaced according to (6.3-3) and then use 
(6.3-10) to get 

Ri= f P(t)s,(t)dt, i — 1,2, ..., M. (6.4-5) 

-'o 

The optimum receiver of Fig. 6.4-l(a) implements the decision rule (6.4-1), 
when Ri is represented as in (6.4-5). Because p(t) is just a specific value 

of r( r), we have chosen to replace it by r(t) in the figure to represent the 

fact that the receiver operates on r(t ) as shown, whatever its particular 
value. 

Alternatively, the response variables of (6.4-4) can be written differently 
by replacing p } according to (6.3-11) to obtain 

N fT 

Ri = 2 s u I p(t)<t>j(t) dt. (6.4-6) 

j- i ~'o 

The optimum receiver can now be put in the form shown in Fig. 6.4-l(b). 
This form of receiver may be more attractive than that in (a) when many 
signals with few dimensions are to be used, that is, when N « M. 

The difference in the two optimum correlation receivers lies only in 
the manner in which the response variables R, are generated. The receiver 
of Fig. 6.4-l(a) uses the fact that it knows the signals s,(t) available for 
transmission. The receiver of Fig. 6.4-1 (b) uses the set of orthonormal 
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MO a, 

■~<f~ j; ^ 

k = Value 
of i, Where D, 
Is Largest at 

*j(0 b 2 

1 = T 




**(') b m 


(a) 



(b) 

Figure 6.4-1. Optimum correlation receivers that (a) use the possible transmitted 
waveforms and (b) correlate against the orthonormal waveforms. 

functions found from the known signals s,(t) to define the coefficients s u 
needed in the matrix (these, of course, define the signal constellation). 
Both receivers can be simplified somewhat when all possible waveforms 
sAt) have the same energy and are equally probable. Then all biases B, 
are the same and the decision rule of (6.4-1) reduces to a test for the largest 
of the response variables R t . 


Matched Filter Structures 

It can be shown that filters with impulse responses, denoted by 
that are given by 


h,(t) 


s t (T - /), 

0 , 


0«r«7' 

elsewhere 


(6.4-7) 



(a) 





(b) 

Figure 6.4-2. Matched filter forms of receivers, (a) Receiver with impulse re- 
sponses h,(t) given by (6.4-7) that are equivalent to the receiver of Fig. 6.4-l(a). 
(b) Receiver with h,(t) defined by (6.4-8) that is equivalent to that in Fig. 6.4- 
1(b). 
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for i = 1,2 M, produce the identical responses to r{t) at the decision 

tune t- T as do the product-integrators in Fig. 6.4-l(a). On replacing 
these devices by the filters, system performance is unchanged and a matched 
filter form of receiver is obtained as illustrated in Fig. 6.4-2(a). As the 
reader may surmise, the name derives from the fact that (6.4-7) defines the 
white noise matched filter for signal r,(r). (See (B.9-8).) 

In a similar manner, filters defined by impulse responses 


h,(t) 


<t>i(T - t), Osisl 

0, elsewhere 


(6.4-8) 


can replace the product-integrators in Fig. 6.4-l(b) with no loss in performance 
The matched filter receiver is depicted in Fig. 6.4-2(b). 


! 


: 


★6.5 ilf-ARY AMPLITUDE SHIFT KEYING 

Much of the remainder of this chapter is concerned with specific signal 
constellations that illustrate the preceding optimum receiver concepts. One 
constellations is associated with M-ary amplitude shift keying 
(M- ASK), where a carrier having any one of M amplitudes can be sent 
during a symbol interval. 

Signal Constellation 


Let the M amplitudes A, be equally probable and uniformly separated 
by an amount A as defined by 


A, = A, + (/ - 1)A, i = 1, 2, ..., M. 

The transmitted signals become 

s,(0 = A, cos(o)„/ + 0 O ) = %/£)<£,(?), /' = 1,2, M, 

for 0 =s t « T, where 


(6.5-1) 

(6.5-2) 


E= *l 
1 2 


and only one orthonormal function is required as given by 

= y|cos(w 0 r + 0 O ). 


The applicable signal constellation is shown in Fig. 6.5-1. The vector 
of signal ls,(t) is just a point s n = y/E, in direction <i, . The distance between 
points, denoted by d, is found to be 



(6.5-5) 


i 

i 
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Figure 6.5-1. Signal constellation for M equally spaced Af-ASK signals. 


Decision region boundaries are located midway between adjacent pairs of 
points. 

System and Its Error Probability 

Of the four forms of receiver structure we choose to illustrate only 
that of Fig. 6.4-2(b) as shown in Fig. 6.5-2. Filter impulse response h,(t) 
is given by (6.4-8). Biases B, are given by (6.4-3) except the term involving 
jV 0 can be omitted because it is the same for all / (when messages are 
equally probable) and it does not affect the decision rule of (6.4-1). 

Average probability of symbol error P w is given by (6.3-18) using (6.3- 
15) with only one dimension. After considering property 1 of signal con- 
stellations given in Sec. 6.3 and the symmetry of decision regions in Fig. 



Figure 6.5-2. Optimum M-ASK receiver for waveform set defined by (6.5-1) 
and (6.5-2). 
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6.5-1, we see that 


J '- i-VE+W/2) . 

Pr(pi I m,) dpi = I -7=== e~ 0> !~ v ® 

1 1 -'Va-w/2) \/ttJv 0 

fd/2\or„ 

= 4=1 

\TT -s -d/lVjfis 

= 1 - erfc — 7 = , i = 
[n a similar manner 


VEi—(d/2) 

* rd/2Vjr 0 


1 - erfc 


£ -{p,-v^)W j pi 


(6.5-6) 


i = 2,3, ...,M - 1. 


In a similar manner 


J Pr\P 1 1 m,) dp, = 1 - - erfc |^^=J , I = 1 or M. (6.5-7) 
After substitution of these two expressions in (6.3-18), we have 


P = erfc 
*’ A/ 


M - 1 


rrr (6.5-8) 


We may compare (6.5-8) with the symbol (word) error probability in 
A/-ary PAM given by (4.9-15). By noting that A 2 T is the incremental energy 
per symbol in the PAM system while it is A 2 T / 2 in the ASK system, we 
find the two systems have the same symbol error probability if they both 
use the same increment of energy per symbol between levels. Since the 
ASK system can be viewed as a PAM system except with carrier modulation 
and demodulation operations, we conclude that no performance loss occurs 
due to the use of a carrier. 

Example 6.5-1 

As an example of M-ASK we compute the average transmitted power for signals 
defined by (6.5-2) when the M messages are equally probable. 

J M ^2 1 M 

Average power = — £ = ttz 2 [A, + (i - 1)A] 2 . 

M 2 2 M l= , 

On expanding out the square and using the known sums 
v . M(M + 1) 

Z"—r~ 


Si'- 


M(M + l)(2Af + 1) 


Average power 



A,A(M - 1)1 

1 

§ 

1 

J 

.4 

\ 2 / 

L 2 J 

L 12 J 


I 
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Example 6.5-2 

As a second example, suppose we find the amplitude A t that minimizes the average 
transmitted power found in the preceding example. The derivative of the average 
power is seen to be zero when 

, _ A(M - 1) 

A, 2 

The second derivative is positive so this value of A, does correspond to a minimum. 
From (6.5-1) we find the other extreme amplitude to be 

A(M - 1) 

A« - 2 

Thus amplitudes in M-ASK should have both positive and negative values, sym- 
metrically displaced about zero, to obtain minimum average transmitted power when 
messages are equally probable and amplitudes are uniformly separated. 

For the binary case, M = 2, the amplitudes are simply -A/2 and A/2. Of 
course, this corresponds to a carrier being amplitude modulated by a polar waveform 
format. Thus minimum power binary ASK is the same as binary PSK using a polar 
data waveform. 


★6.6 M - ARY PHASE SHIFT KEYING 

In M- ary PSK (Af-PSK) the phase of the transmitted signal can take on 
any one of M values in a given symbol interval while amplitude is maintained 
constant. Thus the signal set {«,(/)} can be defined by 


l cos(o > 0 r + 0 O + 0.)> 


os(«r 

elsewhere 


where 


(l — 1)2-77 


i = 1,2 M. 


( 6 . 6 - 1 ) 


( 6 . 6 - 2 ) 


Here we assume o> 0 is an integer multiple of 2ir/T and 0 O is an arbitrary 
constant phase. 

Signal Constellation 


By expanding (6.6-1) we have 

Sj(t) = A cos(0,)cos(to o t + 6 0 ) - A sin(0,)sin(co o t + 6 0 ) 
= ,y n c5i(f) + s a 4> 2 {t) 
where _ 

<#>i(f) = /- cos(o> 0 t + 0 O ) 


(6.6-3) 


;sin(o) 0 t + 0 o ) 
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5,i = \fE cos(0j) 

(6.6-6) 

s i2 = Ve sin(0,) 

(6.6-7) 

1 S— 1 1* 

bsr 

ii 

*5 

(6.6-8) 


The signal constellation applicable to (6.6-3) is shown in Fig. 6.6-1. The 
special case M = 8 is illustrated to demonstrate clearly the decision regions 
involved. More generally, these wedge-shaped regions are centered on the 
applicable signal vector, and the wedge’s total angle is 2ir/Mfor the equally 
probable signals assumed. All signal vectors lay on a circle of radius 
Vl because all signals are assumed to have equal energy over a symbol 
interval. 

Symbol Error Probability 

By applying properties 1 and 2 of signal constellations (Sec 6.3) and 
noting the symmetry of the decision regions in M-PSK, we find that all 
integrals in (6.3-18) are equal. Word (symbol) error probability is, therefore, 

r- r cpi — V^) 2 p11 Ar. 


J -oo /"pi tar 

J 

pl = 0 P2 m ~ 


pitan(7r/M) 


- (p,-v£) 2 p_n 

P L ^oj 
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In writing (6.6-9), use has been made of (6.3-15) with N = 2, since only 
two dimensions are involved and the integrals have been taken over /,. 
By simple changes of variables, (6.6-9) can be written as 

'•-= 1 -IL “ P H“- V3TJ \L 

( 6 . 6 - 10 ) 

Although the integral over f in (6.6-10) can be solved in terms of the 
error function, the remaining integral apparently has no closed form. It 
has been shown to have a good approximation [7, 11] for P„ < 10“ 3 and 
M > 2 given byt 


~ erfc 




(6.6-11) 


Numerical evaluations of (6.6-10) have also been made. From tabulated 
data given in Lindsey and Simon [7], we construct the curves of Fig. 
6.6-2. These curves are plotted as a function of 

e = - , (6.6-12) 

jfjogAMY 

which is the ratio of average energy per bit of information conveyed in a 
symbol to N 0 ■ Clearly, as one increases M to transmit more bits of information 
in a fixed bandwidth (same symbol duration), transmitted power (or energy 
per symbol) must increase to compensate for the increase in P„ that would 
otherwise occur. 


Example 6.6-1 

We compare (6.6-11) for M-PSK to (4.9-18) for M-PAM when M is large and the 
same for the two systems. From (6.6-11) 


• 2 log 2 (M) 


M» 1. 


From (4.9-18) 


P w = erfc 


3 log 2 (M) 


7: E M-PAM 


M » 1. 


For the same values of P w we require 


Sjwpam — j £m-psk — 3.290 ej/.psk. 

Therefore, the M-PAM system requires approximately 5.17 dB more average trans- 
mitted power than the M-PSK system for the same symbol rates, values of M and 

P„- 


t From bounds given in [12] it can be shown that th e right side of (6.6-11) is an upper 
bound on P w . A lower bound from [12] is (])erfc[\/£/A'o] < A - 
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Figure 6.6-2. Symbol (word) error probability of M-PSK system having equally 
probable messages. Curves plotted from data in [7] with permission. 


★6.7 ORTHOGONAL SIGNAL SETS 

A set of orthogonal signals {s,(f)} can be constructed by choosing orthonormal 
functions as the signals and assigning only one signal per coordinate in 
signal space. We have, then, as many signals as there are coordinates and 
M — N. We consider only the case of equally probable messages and 
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(6.7-1) 


equal energy waveforms given by 

for i = 1, 2, ..., M = N. Energy per interval is denoted by E. 

Figure 6.7-1 illustrates the orthogonal signal set’s constellation for the 
special case M = N = 3. 


Decision Rule and System 


Because of our assumptions, the biases B i of (6.4-3) are the same for 
all messages. They cancel in the decision rule of (6.4-1) so that the optimum 
decision occurs when the receiver selects message m k when k corresponds 
to the largest response variable. That is. A: is the value of i for which 

N 

R, = 2 Pfiu (6.7-2) 

7-1 

is largest. However, since signal i has a projection ojily into coordinate 
we have = 0 except when i = j where s tl = \JE. The decision rule 
reduces to finding which response 

Ri = PiS« = pi\/E (6.7-3) 

is largest. Finally, this rule is equivalent to choosing m = m k if 

pi > pi, all i f k. (6.7-4) 

A receiver based on this rule, as derived from Fig. 6.4-2(b), is shown in 
Fig. 6.7-2. 


Error Probability 


From symmetry in the signal set, the probability of a correct symbol 
decision, given a particular message is transmitted, is the same for all 



Figure 6.7-1. Signal vectors for M = 
N = 3 orthogonal signals. 
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Figure 6.7-2. Optimum receiver for M = N orthogonal, equally probable, equal- 
energy messages. 


messages. This fact means (6.3-17) can be written as 
P w = 1 - P(C | m k ) 


pXPii • Pn I m k) dp\ ••• d Ps 


(6.7-5) 


with m k being the message transmitted and the joint density given by (6.3- 
15) with i = k. Since signal k only has a projection in coordinate k, all 
Siy = 0 except s kk = Ve in (6.3-15) and 


p,(Pi> - --, Pa/ I i»k) = (^o) N/2 e 




-1 


n cx p(-F„)- 


(6.7-6) 


Equation (6.7-4) actually defines the decision region I k to be used in (6.7- 
5). For an observed response p k in the correct receiver channel, it says 
that all other channel responses p, , i ± k, must not exceed p*. Thus (6. - 
5) becomes 


exp(-p?/Xf 0 ) 


= 1 


J ~“ „-(p*-Ve) z /jVo n 

7=^ -n J 

p*-— Vtt^o 

iy* 

J "“ -(p»-vi) j /^o r p* e -pV^o "l ^- 1 

=r- -7= dp dp k 

p*=-“ V w -^o L-'p--“ wnJfo 


(6.7-7) 


or 


r , r lU'-i -(p-VB 2 /yo 

p -’ -2' ' rf te]} <6J ' 8) 

Although (6.7-8) has no known closed form, it can be upper-bounded 
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by [4, p. 266]t 

K « erfc[V£/2A' 0 ], (6.7-9) 

and has been tabulated [7], Figure 6.7-3 illustrates behavior of (6.7-8) as 
a function of 

E 

e = T7 i — ^T77 (6.7-10) 

yV o log 2 (M) 

using data from [7], 

It is interesting to note that the curves of Fig. 6.7-3 imply error-free 
performance for very large log 2 (Af) as long as e > 0.693 (or -1.59 dB). 
In practice such an ideal condition is not approached very closely. The 
limitation does not lie in excess required energy per symbol (see Example 
6.7-1), but rather in excessive required bandwidth which increases proportional 
to M (see Sec. 6.9). 


Example 6.7-1 

We find the required values of energy to produce P w = 2(10" 4 ) when log 2 (M) = 
1 (or M = 2) and log 2 (M) = 15 (or M = 32,768). From data in [7] or approximately 
from Fig. 6.7-3, we have e = 12.5 for M = 2 and e = 2.10 for M = 2 IS = 32,768. 
Thus 

e (for M = 32,768) = 2A0 = E (for M = 32,768) log z (2) 
e (for M = 2) 12.5 E (for M = 2) log 2 (32,768) 

or 


E (for M = 32,768) _ 2.10(15) 
E (for M = 2) ~ 12.5(1) 


2.52 (or 4.01 dB). 


The energy required per symbol with log 2 (M) = 15 is, therefore, only 2.52 times 
as great as that needed in a binary system when P w = 2(10" 4 ) and both systems 
use the same symbol duration. 


Simplex Signal Sets 

As noted in Sec. 6.3, the origin of signal space can be shifted to a 
new location to minimize the average energy in the signal set without 
affecting the symbol error probability when equally probable messages are 
conveyed over a white Gaussian noise channel. If this signal constellation 
property is applied to the equal-energy orthogonal signal set, the result is 
called a simplex signal set, which is optimum when energy is constrained 
[13; 4, p. 260], 

Let (s„, si 2 , ..., s M ) represent the vector of signal s,(t). Because 


t The bound becomes increasingly tight as £/jV 0 increases for fixed M : for N = 2 the 
bound is the exact result (equality applies) [7, p. 198], 
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is the energy in s,(r) (same for all i). Thus the vector of s t (t) is 

(s n> s t 2 , ..., s a , ..., s iM ) = (0, 0, ..., y/E 0). (6.7-13) 

The translation to achieve minimum energy is to the constellation’s centroid, 
found by statistically averaging over all signal vectors. This centroid, denoted 
by the vector (a,, a 2 , a M ), is given by (see Prob. 6-33) 


_ (Ve Ve Ve' 

(a u a 2 ,...,a M ) M 


(6.7-14) 


In the new (simplex) coordinates, let the signal vectors have components 
s-j*; then 

(-VE/M, j * i 




(6.7-15) 


(tf,...,^, ...,*&) = 


■ VS >-4 


Expressions for the new simplex signals become 


(6.7-16) 


^(r) = X agtyO, i = 1-2, .... M. (6.7-17) 

7“ I 

It can be shown (Prob. 6-35) that energies of signals in the simplex set are 
equal and given by 

E u> = £(1 - -M. (6.7-18) 


With M = 2, 3 and 4, for example, the savings in energy are 3.0, 1.76, 
and 1.25 dB, respectively; for large M savings become negligible. 

An added advantage of the simplex signal set is that its M signals 
span a space of only N = M — 1 dimensions, one less than the original 
orthogonal set. This fact becomes apparent when we note that the simplex 
signal vectors result from subtracting the centroid vector (the average over 
the orthogonal signal vectors) from each orthogonal signal vector. The sum 
over all simplex signal vectors is therefore the null vector, so any one of 
the simplex vectors can be expressed as a linear combination of the others, 
which reduces the number of dimensions by one. An example will help 
clarify these points. 


Example 6.7-2 

Consider the binary case where M = 2. The applicable signal constellation is shown 
in Fig. 6.7-4(a). The signals are given by 


r,(t) = VBMOl < 
sM = V®W») J ’ 


T, 



(a/' ‘ 0 ) = ( zz r ,,z r) 
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so the signal vectors are 

C 5 !!) ■Su) = (V£, 0) 

(■^21 » s 22) = (0, y/E). 

The centroid of these vectors is the vector 
as shown in the figure. 

We shift the origin to the centroid by subtracting the vector (a,, a 2 ) from the 
signal vectors. The new vectors, given by 


<«, vs y-t-;* 

are shown in Fig. 6.7-4(b). These vectors, according to (6.7-17), define the simplex 
signals, which are 


y/E y/E' 




- <hU)] 


sf(t) = = ~mt) - Ht)]. 

Clearly, the energy in either of these simplex signals is E/2, as predicted by (6.7- 
18). Thus the simplex set uses only half the energy of the orthogonal set. 

To show that the simplex set requires only one dimension (one less than the 
orthogonal set), we observe that the simplex signals can be written as 

aftO = 

s?U) = -J^U). 

Here we define the unit-energy orthogonal function 

tf'U) = ^W>i(0 - <W01- 

These expressions show that the two signals require only one dimension, as shown 
in Fig. 6.7-4(c). 


★6.8 M-ARY FREQUENCY SHIFT KEYING 

In M - ary frequency shift keying (M-FSK) the frequency of a carrier is 
keyed to one of M possible frequencies during any symbol interval of 
duration T. Amplitude is usually constant. If we consider only frequencies 
separated uniformly by an amount Ao>, the M-FSK signal set can be written 
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l cos[m 0 f + 0 O + (/' - l)Acof], 0 *£ t « T 
. elsewhere 


(6.8-1) 


for i = 1, 2, ..., M. 

If A and 0 O are constants, Ao> is an integral multiple of the symbol 
rate 2n/T, and o> 0 is a multiple of half the symbol rate, it is readily shown 
that the waveforms 

1 /^cos[<d 0 r + 0 O + (i - l)Ao>f], 0S(S7 
v T (6.8-2) 
q elsewhere, 

are orthonormal on [0, T], (Also see Prob. 6-36.) Thus (6.8-1) can be 
written as 

*M = i = 1, 2, ..., M, (6.8-3) 

where 


E = — . (6.8-4) 

From (6.8-3) the signal set is clearly orthogonal because only one 
signal is assigned to each of the M orthogonal coordinates. One form for 
the optimum receiver is shown in Fig. 6.7-2; the filters are defined by their 
impulse responses 

hiU) = s,(T- f) = \/£</>,(7 - t). (6.8-5) 

Because the M-FSK signal set is orthogonal, the symbol error probability 
is given by (6.7-8) for M equally probable signals. 


★6.9 QUANTIZED PULSE POSITION MODULATION 

Another example of an orthogonal signal set is quantized (M-level) pulse 
position modulation (M-PPM). PPM is generated by letting the positions 
of a pulse (relative to a nominal position) vary according to the amplitudes 
of a message that is quantized to have only M discrete levels. The PPM 
signal, therefore, has only M discrete time positions that may be occupied. 
Let the symbol duration 7 be the interval containing the M positions as 
shown in Fig. 6.9-1. The pulses 

4>M = rect 

are easily seen to be orthonormal. In terms of the set {<£,( 0 }, the available 
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0 T_ t 
M 

Figure 6.9-1. Pulse positions of orthonormal pulses in set (4>,(r)} and the pulse 

M). 


transmitter signals are 


where 


J ( (0 = A rect 






(6.9-2) 


(6.9-3) 


is the energy per pulse. 

Symbol error probability for M-PPM is given by (6.7-8) or Fig. 6.7-3 
for equally probable message levels where now 

= E A 2 T 

E ~ N 0 log 2 (M) " Jf 0 M log 2 (M) ' (6 ' 9 " 4) 


Interesting comparisons may be made between the orthogonal M-FSK 
and M-PPM systems. First, as M becomes large the bandwidths of both 
systems increase but in different ways. Bandwidth in the FSK system in- 
creases because M relatively narrowband (order of \/T Hz) waveforms are 
used that have different spectrum positions (separations about 1/7’ Hz)t. 
In PPM all transmitted waveforms in the signal set have the full bandwidth 
due to pulses of durations T/M (bandwidth M/7 Hz). Thus bandwidth in 
both cases is about the same and increases linearly with M. 

A second comparison of M-PPM with M-FSK considers the pulse 
amplitudes required to maintain a given performance level ( P „ ). For equal 


t For some values of phase, 6„, the separation can be as small as 1/27" Hz (see Prob. 


6-36). 
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values of M and P w the two systems require the same values of e. In the 
M- FSK system e is given by 

A 2 T/2 


(AT- FSK). 


(6.9-5) 


^ologj(M) 

It is given by (6.9-4) for M-PPM. On equating the two expressions for the 
same symbol durations T, we find 


— (forM-FSK) = —(for M-PPM). 
2 M 


(6.9-6) 


In other words, for a given performance in the M-FSK system, the peak 
pulse power A 2 in M-PPM must increase with M to maintain the same 
performance. This fact places severe constraints on the use of M-PPM for 
peak-power constrained applications, such as in satellite transponders. 


★6.10 BIORTHOGONAL SIGNAL SETS 

For a given number of signal space dimensions, N, a simple way to double 
the number of signals compared to an orthogonal set is to add an additional 
signal vector in each coordinate that is the negative of the orthogonal signal 
vector in that coordinate. Signal constellations of this type are called bior- 
thogonal signal sets. Figure 6.10-1 depicts a biorthogonal constellation for 
N = 2 dimensions and M = 4 signal vectors. 

Optimum Receiver 

To define the optimum receiver we first define the indexing of signal 
vectors. We define signals s t (t) and s 2 (t) by vectors (a/E, 0, ..., 0) and 
(- y/E, 0, ..., 0), respectively, in coordinate 1. In coordinate 2 we assign 
the vector of signal 3 to the positive axis and the vector of signal 4 to the 



negative axis. By continuing in this manner, coordinate j has the vector 
of the odd-numbered signal s 2J -i(t) assigned to its positive axis and the 
vector of the even-numbered signal s 2J (t) along the negative axis. With 
these assignments we have 


' Vf. 
-y/E, 


o, 


i = 2 j - 1 , 
i = 2 j, 
otherwise. 


j = 1, 2, ..., N 
j=\,2,...,N ( 6 - 10 ‘ 1 ) 


The optimum receiver can take on any of the forms of Figures 6.4-1 or 

6.4- 2. We elect to illustrate only that of Fig. 6.4-l(b). 

Because all signals $,•(/) have the same energy E and because all 
messages have been assumed equally probable, the biases B, of (6.4-3) are 
all the same. The receiver can, therefore, be implemented to test for which 
response variable /?, in Fig. 6.4-l(b) is largest. A further simplification 
follows the use of (6.10-1). Because every nonzero component s tJ has 
magnitude y/E, the receiver can simply test normalized response variables 
Rt/y/E for the largest. These considerations lead to the optimum receiver 
shown in Fig. 6.10-2. 

An alternative form of optimum receiver follows the reduction of Fig. 

6.4- 2(b) in a manner similar to the above. The final receiver is identical 
to that in Fig. 6.10-2 except the product devices and following integrators 
are replaced by matched filters defined by (6.4-8). 





Figure 6.10-1. Signal constellation for 
biorthogonal signal set when M = 4 
and N = 2. 


Figure 6.10-2. Optimum receiver for biorthogonal signal set. 
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Symbol Error Probability 

It can be shown that the symbol error probability P w is given by 


L • > • J 

J '*°° f r „ ]](«/j)-i -ip-vs 1 /* 

. i'-HAII ~^r dp - (6,0 - 2) 

No closed form is known for the solution of (6.10-2). However, an upper 
bound on P w is known to be [3, 4, 7]t 


M - 2 
2 


:[ /-§-] + ^ erfc f l~ 

LV2^ 0 J 2 LV^o 


(6.10-3) 


and P w has been numerically evaluated and tabulated [7], The behavior of 
P w is illustrated in Fig. 6.10-3 as plotted from the tabulated data. When 
these curves for M biorthogonal signals are compared with those of Fig. 
6.7-3 for M orthogonal signals, we find nearly the same performance when 
e is large and log 2 (M) 3 2, but the biorthogonal signal set requires only 
half the number of dimensions required by the orthogonal set. 

QPSK Example 

When M = 4 both Af-PSK and the biorthogonal signal sets are equivalent 
to the signal set of QPSK. All three analyses must, therefore, produce the 
same word error probability. For M = 4 minor changes of variables will 
show that (6.6-10) for A/-PSK and (6.10-2) for the biorthogonal sets are the 
same. These functions are known to have the solution [7] 




(6.10-4) 


To show that (6.10-4) is truly the same as in QPSK, we recall from 
Chap. 5 that average bit error probability was given by (5.4-5). On recognizing 
that average energy per symbol in QPSK is E - 2 A 2 T b $ we write (5.4-5) 


P - = - erfc 


[ /*- 1. 
LV2A0J 


(6.10-5) 


Now the probability that a word is in error is one minus the probability 
that both bits on the independent quadrature QPSK channels are correct, 
which is (1 - P e f. Thus 

P„ = 1 - (1 - P e f = 2 P e - P\. ( 6 . 10 - 6 ) 

On substituting (6.10-5), (6.10-6) is seen to equal (6.10-4), as it should. 

t The bound is increasingly tight as E/,V 0 increases for fixed U. 
t In Chap. 5 the transmitted waveform’s peak amplitude was \/2A, so peak power is 
2A 2 /2 = A 2 ; energy per symbol of duration T = 2 T b becomes A l 2T b = E. 
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-10 -5 0 5 10 15 


e A^ 0 log 2 (M) dB 

Figure 6.10-3. Word error probability for optimum system using a biorthogonal 
signal set for equally probable messages and a white Gaussian noise channel. 
Curves are plotted from data in Lindsey and Simon [7] with permission. 


★6.11 VERTICES OF HYPERCUBE SIGNAL SETS 

As a final example of optimum M-ary systems, we consider one in which 
equally probable signal vectors lie on the comers of a hypercube centered 
on the origin. Because an N-dimensional hypercube will have 2 N comers, 
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we constrain the problem to the case where M = 2 N messages are used. 
Fig. 6.11-1 illustrates the geometry involved for N = 3 and M - 8 signals. 

All vectors are equally distant from the origin and will have equal 
energies per symbol ( E ). Signal vectors are defined by 

« = 1.2, ...,M = 1 N (6.11-1) 

where 


SiJ=± Jjj’ all,and 7- 

Actual transmitted signals are, of course, given by 

N 

SM = 2 SiMO, i= 1,2, ...,M = 2 n . 


(6.11-2) 


(6.11-3) 


j - 1 


System and Its Error Probability 

The optimum system using a vertices of hypercube signal set can 
employ any one of the optimum receivers of Figs. 6.4-1 or 6.4-2 with the 
biases B, neglected (set to zero). The biases can be ignored because of our 
assumption of equal energy signals and equal probability messages. 

Symbol error probability is especially easy to obtain for this signal 
set. A given message is identified correctly in the receiver if the corresponding 
signal j,-(r) through its signal vector is correctly identified. The identification 
amounts to identifying correctly the applicable hypercube corner. Since 
every corner has a component of its vector in every coordinate, its correct 
identification amounts to all coordinates being correctly demodulated. 
However, since the noises in all the coordinates are independent with the 



Figure 6.11-1. Signal vectors on vertices of three-dimensional cube (case of 
M = 8, N = 3). 
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same noise powers, the probability of all coordinates being correctly de- 
modulated is (1 - p ) N , where p is the probability that a given coordinate 
is not demodulated correctly. Hence the probability of a symbol (word) 
error is 

P* = 1 — (1 — pf- (6.11-4) 

Probability p is recognized as the average probability of erro r in a binary 
decision (one coordinate) involving signals located at ±\/E/N. As in polar 
formatting, this probability readily evaluates to 



r 


(6.11-5) 


( 6 . 11 - 6 ) 


(6.11-7) 


for £/A" 0 log 2 (Af) large. 

Polar NRZ Format Example 

The vertices of a hypercube signal set has a direct application to the 
polar-NRZ waveform format. Consider M message levels encoded with N- 
bit binary codewords, where the polar waveform has rectangular bit pulses 
of amplitude ±A and duration T b . A symbol, or word, has duration 

T = NT b (6.11-8) 

and energy 

E = A l T = A 2 NT b . (6.11-9) 

A codeword waveform, s,(f), can be expressed as 

~t - O' - 0.5)7; 


sM = 2 afreet 

i - 1 


( 6 . 11 - 10 ) 


where 

a,j=±A (6.11-11) 

with the sign corresponding to data bit j in codeword i. To illustrate these 
points for the case M = 8, N = 3, we include Table 6.11-1, where coefficients 
a u /A are defined for the eight signals s,(f). 

To express (6.1 1-10) in terms of signal vectors and orthonormal functions, 
we write 


sM = 2 s,j4>j(t), 
j - 1 


(6.11-12) 
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TABLE 6.11-1. Codewords, Signal Vectors and Coefficients a u for M 
N = 3 Dimensions 


8 Vectors in 



Data 

Codeword 


a u / A for j Shown 

i 

Signal Vector i 

1 

2 

3 

i 

000 

( -VWn , -VWn, -VWn) 

-1 

-1 

-1 

2 

001 

(-VWn, -VWn, VWn) 

-1 

-1 

1 

3 

010 

(- VWn , VWn, -VWn) 

-1 

1 

-1 

4 

011 

(-VWn, Ve/n, VWn) 

-1 

1 

1 

5 

100 

( Ve/n, -VWn, -Ve/n) 

1 

-1 

-1 

6 

101 

( Ve/n, -VWn, VeJn) 

1 

-1 

1 

7 

no 

( VWn, VWn, - VWn) 

1 

1 

-1 

8 

in 

( VWn, VWn, VWn) 

1 

1 

1 


where 


= 



t-(j ~ 0-5 )T b 
T b 


j = 1, 2 N (6.11-13) 


s,j = ±y / A 2 T b = alii and J. (6.11-14) 

The sign of follows that of a u . Signal vectors are, of course, given by 
(6.11-1). Again we refer to Table 6.11-1, this time for construction of example 
signal vectors in the case M = 8, N = 3. These vectors correspond to 
the numbering in Fig. 6.11-1. 

The preceding work shows that the polar-NRZ waveform is an example 
of a vertices of a hypercube signal set. This fact means that (6.11-6) must 
give the word error probability. Since E/Jf 0 \og 2 (M) = E/Jf 0 N = 
A 2 T b /./V 0 , it is clear that p in (6.11-5) is just P„, the bit error probability 
in the polar-NRZ system from (4.4-7). Thus 


P w = 1 -[1 -P e ] N , (6.11-15) 

which was previously found in (4.5-1). There is complete agreement seen 
between the vector approach taken here and the approach of the earlier 
work. 


6.12 SUMMARY AND DISCUSSION 

M-ary systems are the main topics of this chapter. These are systems in 
which the digital source is allowed to generate any one of M discrete 
messages in any one symbol interval. For the first time in the book the 


signal space, or vector, method of representing signals and noise is introduced. 
The method is a slightly more advanced analysis technique than used elsewhere 
in the book, but it has far more power. The general M- ary coherent system 
and the vector representation method are described in Secs. 6. 0-6. 2. 

By using the new analysis method, the optimum M-ary digital system 
is found. It is defined as the system that produces the largest probability 
of making a correct decision, in any given symbol interval, as to which 
symbol is being received at the receiver. The optimum decision rule is 
developed in Sec. 6.3 that leads to the optimum receiver (Sec. 6.4). The 
optimum receiver has both a correlation form (Fig. 6.4-1) and a matched 
filter structure (Fig. 6.4-2). The average probability P w of making a symbol 
error is given by (6.3-18) which can only be solved further for specific 
systems. 

The remainder of the chapter explores specific examples of the optimum 
system. M-ary ASK (M-ASK) is developed in Sec. 6.5 for equally probable 
messages. The matched filter form of receiver is shown in Fig. 6.5-2. 
Symbol (word) error probability for the M-ASK system is given by (6.5- 
8). For the same increment of energy between (equally spaced) message 
levels, M-PAM, the baseband system of Chap. 4, and M-ASK give the 
same values of P w . 

M-ary PSK (M-PSK) is a constant amplitude waveform with M possible 
phases, one of which is used in each symbol interval. A good approximation 
for P w is defined by (6.6-11). The exact values of P w are shown in Fig. 
6.6-2. The curves show that significant increases in energy per symbol are 
needed to maintain P w at a specified level as M increases. In fact, for fixed 
P w and symbol duration, the increase in energy is proportional to l/sin 2 (7r/M). 
Thus significant increases in energy (or power) are necessary in M-PSK to 
maintain a given performance level (value of P w ) as the information rate 
is increased. 

In Sec. 6.7 the orthogonal signal set is introduced. The set defines a 
class of possible real transmitted waveforms i = 1,2 , ..., M, that 
are nonzero only in the symbol interval of duration T that satisfy the 
orthogonality relation 

f s,(0sj(t)dt= •[$,’ l t J (6.12-1) 

Here £, is the energy in $,(f). In the orthogonal signal set all signals have 
the same energy £, = E. A system using an orthogonal signal set has the 
optimum matched filter form of receiver shown in Fig. 6.7-2; its average 
symbol error probability is given by (6.7-8) which is plotted in Fig. 6.7-3. 
The curves promise error-free performance as M —* oo. Unfortunately, 
this limit is difficult to approach in practice because the required channel 
bandwidth becomes infinite. 

An example of an orthogonal signal set is M-ary FSK (M-FSK) discussed 
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in Sec. 6.8. Another example is M-level pulse position modulation (M- 
PPM) discussed in Sec. 6.9. These two sets of waveforms behave quite 
differently in a practical sense, but, since they are both orthogonal signal 
sets, their symbol error probabilities are the same. It is demonstrated in 
the text, for example, that M-PPM would not be suited to peak-power 
constrained systems such as a satellite link, whereas A/-FSK could be a 
possible candidate for such a link. 

In Sec. 6.10 the biorthogonal signal set is introduced. It is a means 
of doubling the number of signals in a set by properly combining two 
orthogonal signal sets. The optimum biorthogonal receiver is shown in Fig. 
6.10-2 in its correlation form. Average symbol error probability P w , as 
given by (6.10-2), is plotted in Fig. 6.10-3. It is found for large M that the 
biorthogonal system performs nearly the same as the orthogonal system 
for small P„. The QPSK system of Chap. 5 is an example of a biorthogonal 
signal set when M = 4. (Although QPSK was discussed as a binary system 
in Chap. 5, it is in reality a 4-ary system. The earlier binary approach was 
possible only because of the special parallel structure of the system that 
made it possible to examine two binary paths more or less separately.) 

The signal set called vertices of a hypercube derives its name from 
the geometric appearance of the signal vectors used to define the set. This 
signal set is important because it defines the TV-bit binary code. Of course, 
it is an AT- ary set, and, in the case of the binary code there are M = 2 N 
possible codewords. Symbol error probability for the vertices of a hypercube 
set is given by (6.11-6). 
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PROBLEMS 

★6-1. Extend Example 6.1-1 by proving that the waveforms in the orthonormal 
set {\Jl/T cos(n27rt/T)} are also orthonormal to the waveforms in the or- 
thonormal set {y/2/f sin( nlirt/T)}. 

★6-2. Express the Fourier series and coefficient formulas for a periodic signal, as 
given by (A.4-3) and (A.4-4), in terms of the orthonormal functions of Prob. 
6 - 1 . 

★6-3. (a) Show that the exponential signals 

<ft„(0 = (l/VT)exp ^' ,> ' 27r< j, n = 0, ±1, ±2, ... 

are orthonormal on the interval [0, T], 

(b) Write the Fourier series and coefficient formulas for a periodic signal, 
as given by (A. 4-6) and (A. 4-7), in terms of the signals {<£„( t)}. 

★6-4. Define orthonormal functions and coefficients so that a bandlimited function 
represented by the sampling theorem of (2.1-11) can be written as 


f(t) = 2 s k4>k(t). 


(Hint: Use results from Example 2.1-1.) 

★6-5. Given two sinusoids Acos(co 0 t + 0 q) and Bsin( io 0 l + 6 0 ), where A, B, and 
0 O are real, but otherwise arbitrary, constants. Find constraints on u> 0 such 
that the two waveforms are orthogonal on the interval [0, T], 


★6-6. Define N functions 


<hn(t) = 



- [(2/i - l)T/2N] 
T/N 


orthonormal on [0, T], 


n = 1, 2, ..., TV 


M-ar y Digital Systems Chap. 6 

(a) For the case A = 2, express the four waveforms of a unipolar waveform 
format of a 2-bit natural binary code in terms of the functions <f> n (t). 

(b) Plot the signal constellation of all waveform vectors. 

Work Prob. 6-6 except for a polar format. 

★<-8. Work Prob. 6-6 except for a 3-bit code. 

★<-9. Work Prob. 6-6 except for a 3-bit code and polar format. 

★6-10. Given three waveforms as shown in Fig. P6-10. Find orthonormal functions 
to describe this signal set by use of the Gram-Schmidt procedure. Plot a 
signal constellation of the signal vectors. 



Figure P6-10. 


★6-11. (a) Find orthonormal basis functions to describe the waveforms of Fig Ph- 
il. 

(b) Plot the signal constellation. 

(c) Find the energies in the signals by direct integration and by use of (6.1- 
20). Do they both agree? 



Figure P6-11. 


★6-12. Work Prob. 6-11 except for the waveforms of Fig. P6-12. 
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Figure P6-12. 


★6-13. Assume (6.2-2) is true and prove that (6.2-3) is valid. 

★6-14. From Prob. 6-13 assume (6.2-3) is true and prove that (6.2-4) is true. 

★6-15. Prove that (6.2-5) is true if (6.2-1), (6.2-3), and (6.2-4) are true. (Hint: Use 
Mercer’s theorem [10], one form of which states that 

N 

R x (0) = lim 2 klU(0| 2 . 

N-*°° „-| 

★6-16. Use Bayes’ theorem to show that (6.3-13) is true and when used in (6.3- 
12), (6.3-14) results. 

★6-17. Justify (6.3-15) and then use it to show that the MAP receiver’s decision 
rule can be written as in (6.3-16). 

★6-18. A binary system uses M = 2 signals defined on an N - 2-dimensional 
signal space. By solving the MAP receiver decision rule, find exactly what 
received waveform vector points (p,, p 2 ) fall in decision regions I t and / 2 . 
Assume P, = P 2 - 

★6-19. Work Prob. 6-18 except assume P, P 2 . 

★6-20. Equal-probability binary messages are defined in two-dimensional space by 
vectors (-2, 2) and (2, 2). If A" 0 = I for a white noise channel, define 
decision regions I, and I 2 and use (6.3-18) to compute P w . 

★6-21. Draw a block diagram of an optimum receiver of the form of Fig. 6.4-1 (b) 
for the system of Prob. 6-20. Reduce the receiver to its simplest form. 

★6-22. In a particular full duplex modem (separate wires for transmission and 
reception), M is large while N is small, so the receiver is implemented in 
the form of Fig. 6.4-l(b). Since the waveforms {4>j(t)} must be generated 
for receiver use, draw a block diagram to show how they can also be used 
to generate the transmitter signals {r,(t)}. 

★6-23. A transmitter to be used in an M-ASK system has peak and average power 
limitations of 10 W and 3.5 W, respectively. For M = 16 equally probable 
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messages and a constraint on pulse amplitudes of A, » 0, all / = 1,2 

M, select the A, to best utilize the transmitter. What are the final transmitted 
peak and average powers? Is the system most affected by peak or average 
power limitations? 

★ 6 - 24 . Work Prob. 6-23 except ignore the constraint A, 3= 0 and let A, = -A u - 
★ 6 - 25 . An ASK system uses signal amplitudes A, = 0, A, = 0.5, A, = 1.0, and 
A 4 = 1.5 V. If the symbol duration is 10.0 ms, what is the largest allowable 
channel noise density, Af 0 / 2, that will permit a word error probability 
P w = 5.164(10' 4 )? 

★6-26. When the lowest-amplitude signal is zero in M-ASK (when A, = 0), the 
decision variable D, in Fig. 6.5-2 is identically zero. Give arguments to 
justify that message 1 will be correctly selected as transmitted (m = m,) 
provided that n u the noise on p, , is less than \/E,/2 for all i = 2, 3, ..., 
M. 

★6-27. In an M-PSK system P w = 10 4 when M = 4. 

(a) What is the required ratio of energy per symbol ( E ) to jV 0 ? 

(b) If P w is maintained constant and the same channel is used (same A 0 ) 
but M is increased, compute data and plot a curve of the increase in 
energy £ required for M = 8, 16, 32, and 64. 

★ 6 - 28 . In a manner similar to Example 6.6-1, compare e required in an Af-PSK 
system with that of an M-ASK system with A, = - A M to produce the same 
word error probabilities P w when M is large. 

★6-29. An M-ASK system uses M = 32 signals, transmits an average power of 
140 W when amplitudes are A, = 0 and A, > 0 for ; > 1, uses symbols of 
duration T = 1000 /as, and operates on a channel for which jV 0 = 9.31(10" 5 ) 
W/Hz. An M-PSK system uses the same channel, has the same symbol 
duration and peak power, and has the same symbol error probability as the 
M-ASK system. 

(a) Find P„ for the two systems. 

(b) What is the peak transmitted power in the ASK system? 

(c) What is the largest (binary) number of signals (M) that the PSK system 
can use while satisfying the given constraints? 

★ 6 - 30 . A 32-phase M-PSK system transmits symbols of duration 35 fis over a 
channel with white noise power density J<f 0 /2 = 8.89(10' 7 ) W/Hz. What 
minimum transmitter average power must the symbol waveforms have if 
symbol average error probability must not exceed 7. 43(10" 7 )? 

★ 6 - 31 . In an M-PSK system M = 16. What minimum value of e is required to 
guarantee P w « 3.16O0- 5 )? 

★ 6 - 32 . Consider an orthogonal signal set with M = 2 (binary case). 

(a) Define the decision regions I t and / 2 when messages are equally probable. 

(b) Use (6.7-5) to find P w . Compare your result to P e found in (5.5-11) for 
coherent binary orthogonal FSK. (Hint: Change integration variables 
by means of a coordinate rotation of tr/4 rad.) 

★ 6 - 33 . For an orthogonal signal set corresponding to equally probable messages, 
determine the mean signal vector and show that it is given by (6.7-14). 


★6-34. Three signals, each with energy per symbol of 9, belong to an orthogonal 

signal set. 

(a) Find the vector for the origin shift required to form a simplex set from 
the orthogonal set. 

(b) Shift the origin and find the vectors of the three signals in the simplex 

set. \ . 

(c) Write expressions for signals in the simplex set. Show that the third 
signal can be written in terms of the other two as 

s?'(t) = ais'A/) + a 2 sf(t). 


What are a, and a 2 ? 

★6-35. Show that the energies in the signals of a simplex set are equal and given 
by (6.7-18). 

★6-36. If 0 O equals any integer multiple of n/2, show that the waveforms of 
(6.8-2) are orthonormal on [0, T] if <u 0 and Aoj are positive integer multiples 
of one quarter and one half the symbol rate 2tt/T, respectively. 

★6-37. In a discrete PPM system, 32 pulse positions are possible per symbol interval. 
If word (symbol) error probability must satisfy P w =£ 1.152(10 ! ), find the 
smallest value of e that is allowed. For the same M and P w how would 
peak amplitudes of symbol waveforms of an M-FSK system compare to 
those of the M-PPM system? 

★6-38. For equally probable messages draw a block diagram of an optimum receiver 
for a biorthogonal signal set that is based on reduction of Fig. 6.4 2(a). 
Reduce the diagram to its simplest possible form. 

★6-39. Assume the bounding expressions, (6.7-9) and (6.10-3), for P w in orthogonal 
and biorthogonal systems are accurate for computations of P w when £/.A 0 
is large. Give arguments to support the fact that P w in the biorthogonal is 
smaller than that in the orthogonal system for the same E/N 0 and M according 
to 

/^(biorthogonal) = /^(orthogonal) - 

★6-40. Show that (6.6-10) and (6.10-2) are the same when M = 4. 

★6-41. A binary system using a polar-NRZ format collects N = 4 pulses (bits) 
over a time T = 64 /us and transmits the group as a “word." If word error 
probability is = 3.2(10 -2 ) when pulses of amplitude 2 V are used, what 
is the channel’s noise density, AC 0 /2? 
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A. 0 INTRODUCTION 

Waveforms may broadly be classified as deterministic or random. A de- 
terministic waveform is one that can be completely described by an equation, 
a graph, or other definitive manner. A random signal is one in which 
probability theory is required to describe characteristics of the waveform; 
the signal itself cannot be precisely described. In this appendix we summarize 
the most important characteristics of deterministic waveforms and linear 
networks to process them. Random signal theory is reviewed in Appendix 

B. 

A deterministic waveform can also be classified as a power signal or 
an energy signal. A power signal has finite power when averaged over all 
time; it, therefore, also has infinite energy. An energy signal has finite 
energy but zero power based on averaging over all time. We consider both 
forms of signal. 


A.1 ENERGY SIGNALS 

Signals are usually described either in the time domain or in the frequency 
domain by their Fourier transform. 
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Fourier Transforms 

An energy signal/ft) and its Fourier transform F(o>) are related through 
the Fourier transform pair 

F(to) = f f(t)e~ M dt (A. 1-1) 

f(t) = ~ f F(<o)e Ja " dw. (A. 1-2) 

Z 7 T J _oo 

If fit) is a voltage waveform, Fiat) is a voltage density with the unit V/Hz. 
It represents the relative voltage amplitudes (complex in general) of the 
frequencies that constitute fit). Equation (A. 1-2) is called the inverse Fourier 
transform. We sometimes use notation &{■} and 9~ l {-} to represent taking 
the direct (Fourier) transform and the inverse transform, respectively. The 
double arrow is also used to imply a pair. Thus 

9 - '{F(o>)} = fit) 4-» Fiat) = 9 {fit)}. (A. 1-3) 

Under reasonable conditions [1, p. 93] energy signals will always have 
Fourier transforms. In fact, most waveforms of practical interest, both 
energy and power, will have transforms. 


Properties of Fourier Transforms 


We give some very useful properties of Fourier transforms without 
proofs. Other properties are given in [2]. We assume waveforms fit) and 

f„it) for n = 1,2 N have Fourier transforms F(a>) and F„(oi) for n = 

1, 2, ..., N, respectively, while o) 0 and t 0 are real constants and a„ for 
n = 1,2, ..., N are possibly complex constants. The time waveforms may 


be complex. 

The linearity property is: 

N N 

fit) = X «„/„(0 <— ♦ X a„F„M = F(a>). 


The time and frequency shifting properties are: 
fit - t 0 ) — * F{oj)e- JM ° 


fit V* — «• 

The convolution properties are: 
fit) = f fir) fid ~ r) dr 

•j — oo 

/*oo 

fit) = mm «— ^ J _ 


Fiio - (o 0 ). 

< — ♦ F,ia))F 2 (co) = F(w) 
F,H)F 2 (co -f)df = Fiw). 


(A. 1-4) 

(A. 1-5) 
(A. 1-6) 

(A. 1-7) 
(A. 1-8) 
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Thus convolution of two time domain functions corresponds to the product 
of their two spectrums in the frequency domain. Conversely, the time 
domain product of two functions corresponds to the convolution of their 
two spectrums divided by 2 ir. 

The correlation properties are: 

f(t) = f ft (T)f 2(7 + t) dr < — * Ff(tt)F 2 (ai) = F((d) (A. 1-9) 

mJ — <SO 

/* OO 

fit) = fXimf) WOFtf + co)dt = Fiai), (A. 1-10) 

where the asterisk represents complex conjugation. 

Another property can be called Parsevais theorem and derives from 
(A. 1-9) with t = 0: 


J' f\(r)h(f)dT Ffito) F 2 (o>) do. 

Clearly, when/,(f) = f 2 it) = fit), 

J |/W| 2 dt = J |F(o»)f dw, 


(A. 1-11) 


(A. 1-12) 


which is very important because it shows that the energy in fit) can be 
obtained by either a time or frequency domain integration. 

From (A. 1-1) and (A. 1-2) we have the area properties: 


Fi 0) = f fit) dt 

/* 00 

/( 0 ) = J Fico) dai. 

Energy and Energy Deneity Spectrum 

If E denotes the energy in/(t), then from (A. 1-1 2), 


(A. 1-13) 


(A. 1-14) 


%i<o)da (A. 1-15) 


where we define 

g(w) = |F(o>)| 2 (A. 1-16) 

as the energy density spectrum. Clearly the unit of $(<w) is energy per hertz 
if fit) is a voltage or current waveform, t 


t As usual in communication systems theory, we either presume 1-tl real impedances 
or assume voltages and currents are in normalized form so that impedances do not show 
explicitly in energy or power expressions. 
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A.2 SOME USEFUL ENERGY SIGNALS 

Two energy waveforms occur so frequently that they are defined and included 
here for reference. 


Rectangular Function 

The rectangular (pulse) function rect(-) is defined by 

/..—(Hi; <a ' 2 -’ 


where we also define the sampling function Sa(-) by 


i sin(x) 
Sa(.c) = — — • 


(A. 2-2) 


The function rect(f/r) and its spectrum are illustrated in Fig. A. 2-1. 

Triangular Function 

We define the triangular function tri(-) by 


The function and its spectrum are shown in Fig. A.2-2. 


= F(g>). (A. 2-3) 
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(a) 



_4n _2ir 0 2j 4ir u 

T T T T 


(b) 

Figure A.2-2. (a) The triangular function and (b) its Fourier transform. 


A.3 POWER SIGNALS 

As noted earlier, a power signal has a finite power when averaged over all 
time. In this section we introduce additional details about power signals. 

Average Power and Power Density Spectrum 


We define 


m = lim r|- f [•] 

7 *" >°P ^ i •/ _ T 


as the infinite-time average of the quantity within the brackets. For a 
voltage waveform /(f), we have 

■sfl/Wl = lim f /(f) dt, 


(A. 3-2) 


A.4 Periodic Power Signals 
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which is the dc component of /(f), or 

P = ^[|/(f)| 2 ] = lim I l/(f)| 2 dt, (A. 3-3) 

T-*oo LI J _ j- 

which is the average power P in /(f). If /(f) is a real waveform, |/(f)| 2 is 
replaced by f\t) in (A. 3-3). 

Average power can also be found from a frequency domain operation. 
Let/ r (f) represent /(f) truncated to the interval [-T, I] and let its Fourier 
transform be F T (a) according to 



frit) = f(t) rect(~j <— ■* Ffa). 

(A. 3-4) 

It can be shown [3, 

p. 41] that 

/*oo 



P = T~ 1 Wo) da, 
2t r J 

(A. 3-5) 

where 

W * lim {*%*} 

(A. 3-6) 


is called the power density spectrum of /(f). The function .9(a) has the 
unit power per hertz. 


Time Autocorrelation Function 

The time average of the product f*(t)f(t + r), defined by 

£%(r) = lim f f*(t)f(t + r) dt , (A. 3-7) 

T-* °o -J - T 

is called the time autocorrelation function of /(f). It can be shown that 
S%(t) and if (a) are related through the Fourier transform, that is, 


:«») - f 

0l(r)e~ Ja " dr 

(A. 3-8) 


.9(a)e J “" da, 

(A. 3-9) 

mr ) * 

Wo). 

(A. 3-10) 


A.4 PERIODIC POWER SIGNALS 

The periodic signal is one of the most important types of power signal. 
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A.5 Signal Bandwidth and Spectral Extent 
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Fourier Series 


A periodic signal /(f) with period T p satisfies the relationship 

fit) = fit + kT p ), k = ±1, ±2, .... (A. 4-1) 

and can be expressed in terms of an infinite series of time functions known 
as a Fourier series. Two principal forms of Fourier series may be stated. 
The trigonometric form is 


where the 



Fourier series 


+ 2 fl„cos(no) p f) + 2 b n sm(na> p t), 

»"1 n-l 

coefficients are given by 


a n 

b n 

with 


t)(. 
£ 


<o + (7>/2) 


(T p / 2) 

>o + (Tp/2) 


lo-(T p /2) 


f(t)cos(n<i) p t) dt, 
/(f)sin(na>p/) dt, 


n = 0, 1,2, 

n= 1, 2, .... 


2ir 

&)„ = — 
p T 

*p 

and t 0 is any real constant. 

The complex or exponential Fourier series is 


(A. 4-2) 


(A. 4-3) 
(A. 4-4) 

(A. 4-5) 


fit) = 2 C„e j ^' 


where 



to+(r,/2) 
~(Tp/ 2) 


Me-** dt. 


n = 0, ±1, ±2, .... 


(A. 4-6) 


(A. 4-7) 


Impulse Function 


To define and deal with the spectrum (Fourier transform) of a periodic 
signal it is necessary to introduce the concept of an, impulse function. The 
unit impulse function, sometimes called a delta function, denoted by 6(f), 
is defined through its integral property 



4>(t)8(t - to) dt = f {to). 


(A. 4-8) 


Here f 0 is a constant and <f>(t) is arbitrary except it is assumed continuous 
at f = t 0 . The function 8(f — f 0 ) behaves as though it occurs at / = f 0 , 
has zero duration, infinite amplitude at t = t 0 , and area of unity. Other 
forms of impulses also exist, and discontinuous functions <6(f) can also be 
handled [4], 


An impulse function A8(t - f 0 ) is a unit impulse (area one) scaled by 
the amplitude constant A. It is usually shown graphically as a vertical 
arrow at time t = f 0 with amplitude A. Of course, the arrow points to 
infinity to imply infinite amplitude, whereas the plotted amplitude indicates 
the scale constant A. 

By use of (A. 4-8) the Fourier transform of the impulse function is 
readily established 

8(f) *— » 1 (A. 4-9) 

A8(f - t Q ) «— » A e- Jw '\ (A. 4-10) 

Other useful properties of impulse functions are, with a> p = 2ir/T„, 

2 8(f - nT p ) < — > 2tt 2 f^r^8(a> - na> p ) (A.4-11) 

n--o o na-co \* p/ 

2 bit ~ nTf) = W 2 e Jna f (A. 4-12) 

«=-» \ ■*/>/« = - OO 

Spectrum of a Periodic Signal 

By Fourier transforming /(f) of (A.4-6), we obtain the spectrum of a 
periodic signal 


F{w) = 2 tt 2 C,,8(a> - nco p ). (A. 4- 13) 

ft = — OO 

Power Spectrum, Autocorrelation Function, 
and Power 

The power spectrum of the periodic signal is 

£f((o) = 277 2 |C„| 2 8(oj - noi p ). (A.4-14) 

At = — oo 

By inverse Fourier transformation the autocorrelation function is 

&tiT) = £ \Cjfe*- r . (A.4-15) 

At = -oo 

Average power results from use of (A.4-14) in (A.3-5) or from (A. 3-7) with 
r = 0: 

^=2 |C„| 2 . (A. 4-16) 

n “= — oo 

A.5 SIGNAL BANDWIDTH AND SPECTRAL EXTENT 

Many definitions of bandwidth exist. Generally, bandwidth refers to the 
band of frequencies that are most important (largest amplitudes) in defining 
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the signal. Spectral extent refers to the band of frequencies outside which 
the spectral components in a signal may be considered negligible. 

Thrae-dB Bandwidth 

The band of positive frequencies over which the magnitude of a signal’s 
spectrum remains above l/\/2 times its value at a convenient reference 
frequency, usually located in the band, is called the 3-dB bandwidth. Fig. 
A. 5-1 illustrates 3-dB bandwidth, denoted by W in radians per second, for 
several spectral shapes. The reference frequency for a baseband (lowpass) 
signal, as shown in (a), is usually zero, although it could also be another 
convenient value (1000 Hz for the audio band, for instance). Examples of 
bandpass signals are shown in (b) and (c). The spectral extent, denoted 



(c) 


Figure A.5-1. Signal spectrums: (a) Lowpass or baseband, (b) bandpass, and 
(c) absolutely bandlimited bandpass. 


A.6 Linear Networks 
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by W f , is shown in (a) and (b) for typical lowpass and bandpass signals. 
A waveform having a spectrum that is truly zero outside a band, as shown 
in (c), is truly bandlimited, and its absolute bandwidth is the same as its 
spectral extent. 

In the case where /(f) is periodic, its spectrum’s magnitude consists 
of impulses. Calculation of bandwidth remains the same except the envelope 
of the spectrum’s magnitude (locus of tips of positive impulses) is used. 

Mean Frequency and RMS Bandwidth 

Let F(o>) be the spectrum of a signal /(f) (lowpass or bandpass). We 
define mean frequency eu 0 by 


and rms bandwidth by 


f cu|F(to )| 2 da) 

-'o 

f |F(oj )| 2 day 
•Jo 

/ («) - <5o) 2 |FM| 2 day 

-*0 

f |F(ci >)| 2 day 
•Jo 


(A.5-1) 


(A. 5-2) 


for bandpass signals. For lowpass waveforms, (A.5-2) applies if w 0 is set 
to zero. 


A.6 LINEAR NETWORKS 
Impulse Response 

A network, having an output port that responds to a signal as excitation 
at an input port,t is linear if superposition applies. A unit impulse S(f - t) 
applied to a linear network causes a response that is characteristic of the 
network, called its impulse response h(t, r). The network’s response g(l) 
to an arbitrary input /(f) is given by 


f(r)h(t, t) dr. 


(A. 6-1) 


If the network is also time invariant such that h(t, t ) no longer depends in 
form on the time of occurrence of the input impulse, then 

/i(f, t) = h(t - t) (A. 6-2) 


t We consider only single-input, single-output networks. 
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and 

g(‘) = f f(r)h(t - t) dr. (A.6-3) 

J — oo 

This expression applies generally to both energy and power signals. 

T ransfer Function 

By Fourier transformation of (A.6-3) we have 

G(co) = F(oi)H((ii), (A. 6-4) 

where G(co), F(co), and H( at) are the respective transforms of g(t), f(t), and 
h{t). H(ai) is called the transfer function of the linear time invariant network. 
Equation (A.6-4) applies to all signals (energy or power) for which F(ai) 
can be defined. 

Bandwidth 

Bandwidths of a network (3-dB or rms) can be defined in exactly the 
same way as for waveforms. It is necessary only to use |ff(a»)| in place of 
|F(co)| used earlier in Sec. A.5. 


1 H( oj) i or 0 (cj) 
1 








(a) 


1 H(o>) 

1 orfl(co) 




s 

\ 

N. 






(b) 


Ideal Networks 

It is often desirable to approximate the transfer function of a real filter 
by an ideal filter. We define an ideal filter as one that has unity gain and 
a perfectly flat (constant) transfer function magnitude at frequencies inside 
a desired interval and zero response at all other frequencies. This definition 
is visualized by use of Fig. A. 6-1, which illustrates lowpass, highpass, and 
bandpass ideal filters. In each case the filter may have a linear phase 6(w) = 
-cor where r is the filter’s delay constant. 

Energy and Power Spectrum* of Response 

If the input f(t) to a linear time invariant network is an energy signal 
with energy density spectrum %/(o) = |F(w)| 2 , the energy density of the 
response g(t), denoted by £ g {co), is 

W = \G(<of = £/(o)\H((o)\ 2 (A. 6-5) 

from (A.6-4). Total energy E g in the response becomes 

Eg = ^ j £f((ij)\H((o)\ 2 dco. (A. 6-6) 

If/M is a power signal with power density spectrum Sf/co), the power 
spectrum Sf/at) of g(t) is given by 

*>) = . 9 /( 0 ) \H<(0)\ 2 . (A. 6-7) 



i 

H(co) \ or fl(o>) 





i 

/ 

/ 

/ 

/ 

/ 

i 


1 

^ 1 H(<o) | 



(c) 

Figure A.6-1. Ideal filter transfer functions: (a) Lowpass, (b) highpass, and (c) 
bandpass [3], 

Average output power P g in g(t) becomes 

= WltfMP dw. (A. 6-8) 


REFERENCES 

[1] Thomas, J. B., An Introduction to Statistical Communication Theory, John 
Wiley & Sons, Inc., New York, 1969. 

[2] Peebles, Jr., Peyton Z., Probability, Random Variables, and Random Signal 
Principles, McGraw-Hill Book Co., Inc., New York, 1980. 


388 


Review of Deterministic Signals and Networks Appendix A 


Appendix A Problems 


389 


[3] Peebles, Jr., Peyton Z., Communication System Principles, Addison-Wesley 
Publishing Co., Inc., Reading, Massachusetts, 1976. (Figure A. 6-1 has been 
adapted.) 

[4] Korn, G. A. and Korn, T. M., Mathematical Handbook for Scientists and 
Engineers, McGraw-Hill Book Co., Inc., New York, 1961. 


PROBLEMS 


A-l. Characterize the following signals as either power or energy type. 

(a) f(t) = 3 cos(lOf) (d) f(t) = 3 exp(- 2 t)u(t) 

(b) f(t) = 1.7 exp[- 6f 2 ]cos(3f) (e) fit) = 12 rect ^ 

cos(6r) + 9 cos 2 (3f) 


(c) fit) 


1 + 3r 


(f) fit) = 2 3 rect[4(r - n)). 


A-2. Find the Fourier transform of the waveforms of Prob. A-l(d). 

A-3. Find the Fourier transforms of the waveforms of Prob. A-l(e) and (f). 

A-4. Prove (A. 1-5) and (A. 1-6). 

A-5. Prove (A. 1-8). 

A-6. With a a real number, show that the scaling property of Fourier transforms, 
as given by 



with fit) <-> Fiat), is true. 


A-7. For two functions /(f) and F( a>) that are a Fourier transform pair, show the 
validity of the duality property 


Fit)*-*2Trfi-a>) 

of Fourier transforms. 

A-8. A complex waveform /(f) has a Fourier transform F(o>), Find the spectrum 
of /*(f) in terms of Flo). 

A-9. A waveform /(f) is differentiated in time n times. Find the spectrum of 
<d"f!t)/dt " in terms of Fico), the spectrum of fit). 

A-10. Assume pairs 1 1 and 20 of Appendix G are true and use them to prove pair 

12 . 


A-ll. Use (A. 1-9) to find the (auto) correlation function of the rectangular pulse 
/(f) = 3 rect (f/5). 

A-12. Work Prob. A-ll for the waveform /(f) = A u(t)expi-bt), where A and b 
are positive real constants. Plot your result. 

A-13. For the waveform of Prob. A-12: (a) Find the energy density spectrum and 
the waveform’s energy from the energy density, (b) Find the signal’s energy 
from the middle form of (A. 1-15) to verify the result of (a). 

A-14. Find the spectrum of rect(f/T) and verify that (A. 2-1) is valid. 

A-15. Work Prob. A-14 for the triangular waveform of (A.2-3). 


A-16. Find the Fourier transform of /(f) = Arect(f/r)cos( 7 rf/r) and show that it 
is the sum of two sampling functions by means of the frequency shifting 
property of Fourier transforms. 

A-17. By use of the time average defined in (A.3-1) find the dc component and 
average power in the signal /(f) = AcosW), where a> 0 is a constant. 
A-18. Work Prob. A-17 for the signal /(f) = Acos 4 (&>„f). 

Use (A. 3-7) and (A. 3-8) to find the time autocorrelation function and power 
spectrum of the signal of Prob. A-17. 

A-20. Find (a) the trigonometric form and (b) the complex form of the Fourier 
series of the waveform of Prob. A-l(f). 

*A-21. Use Fourier series to prove that 


2 8(f - nT p ) < — ► f sfa, 

»* *p n — -oo \ 

where T p is a positive constant. 

*A-22. Use the result of Prob. A-21 to show that 


f)' 


and 



JL 2 e 1 " 1 ” 11 * = 2 bit - nT„). 

1 p «= -oo -oo 

A-23. If /(f) <-» Flo), show that the Fourier series coefficients C„ of (A.4-7) that 
define the complex Fourier series of the periodic signal / p (f) = 2“ 
fit - nT p ) are given by 



where T„ is the period of fit) and /(f) is nonzero only on ( - TJ2 < t < 

V 2). 

A-24. Use (A. 4-8) to show that 8 (f - t) has “area” of one for any value of r. 

A-25. If the argument of a unit impulse function is itself a function git), then it 
can be represented by 


8[g(f)] = 


8(f 

*o) 

dgit) 

dt 

t = h 


where f 0 is the value of f for which g(f) = 0, that is, f 0 is a root of g(t) = 
0. Use this result to evaluate Bit) defined by 


Bit) = 



4>lt)&(at - b) dt 


where a and b are real constants and <i>(f) is arbitrary except it is continuous 
at the time of occurrence of the impulse. 
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A-26. 


A-27. 


A-28. 


A-29. 


A-30. 

A-31. 

A-32. 

A-33. 

A-34. 


A-35. 


*A-36. 


Find the Fourier transform of the signal 

f(t) = [A + / m (r)]cos(<u 0 r) 

where A and oi a are constants and fjt) has the spectrum F m i<o). 

Determine the value of k in each of the following signals such that if r — * 
0, then fit) -* bit) in the limit. (Hint: Use tables to find the area in (a), 
(b), and (f), and assume r > 0.) 

(a) fit) = k exp^- (d) fit) = k exp^ - 

(b) fit) = k sJA (t) fit) = k rect 


(c) fit) = k tri( - ) 


(f) fit) = k Sa 2 (-) 

W 

Evaluate the following integrals: 


W 

(a) J~ cos(9;)S(r - 2) dt 

(C) _ 

f Sit - 3)cxp(-8r 2 ) dt 

(b) J S it + 2)(1 + 4/ + 8ri) dt 

(d) 

/ 8(0(1 + r 2 )' 1 dt. 

* — oo 


A periodic signal fit) is constructed by forming replicas of the waveform 
/ 0 (r) = a/ 2 rect[(2r - t)/2t] every nT p , n = 0, ±1, ±2, .... where a is a 
constant and T p > t. Find (a) the Fourier series and (b) the average power 
in fit). 

Find the 3-dB bandwidth of the signal fit) = n(r)exp(-af) where a > 0 is 
a constant. 

Work Prob. A-30 for the signal fit) = t 2 u(t)cxp( - at). 

Find the rms bandwidth of the waveform of Prob. A-30. Find the energy 
in fit). 

Find the rms bandwidth and energy of the waveform of Prob. A-31. 

A network’s transfer function is approximately given by H(co) = 1 + jot. 
An input baseband signal fit), having average power P f and rms bandwidth 
W /rms , generates a response git). Find the average power P g in git) in terms 
of Pf and W / rmt . 

A network has transfer function H(o>) = 10(1 +j5<i>)~'. (a) Find the network’s 
impulse response hit), (b) Use convolution to obtain the network’s response 
to the input signal fit) = 12w(/)exp(- f/10). 

Find (a) the energy density spectrum and (b) the energy in the network’s 
response of Prob. A-35. 



Appendix B 

Review of 
Random Signal 
Theory 


B.O INTRODUCTION 

In this appendix we give a brief review of the basic elements of random 
signal theory that are needed in the main text. We use the words random 
signal to imply either a desired random waveform (usually referred to as 
the signal ) or an undesired random waveform such as noise. For additional 
detail the reader is referred to some of the recent literature [1-9], 

Our review begins with the fundamental concept on which all else is 
based, probability. 


B.1 SAMPLE SPACES, EVENTS, AND PROBABILITY 

Sample Spaces 

An experiment for which the outcomes of a trial are random in their 
occurrence is called a random experiment. An example of a random ex- 
periment would be the drawing of a card from a thoroughly shuffled deck 
of 52 cards. The trial is the actual drawing of a card and the card itself is 
the outcome; there are 52 possible outcomes in this experiment. The set 
of all possible outcomes in a given random experiment is called the sample 
space and the outcomes are elements of the sample space. In the card- 
drawing experiment the sample space is called discrete because only discrete 
outcomes are possible. 

Different experiments may have different sample spaces. Consider a 
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circular potentiometer having a 360° tap rotation capability that generates 
any voltage from 0 (at zero-degree position) to, for instance, 15 V (at 360° 
position). An experiment of randomly placing the tap now produces an 
outcome (voltage) that can have any value of a continuum of values from 
zero to 15 V. The sample space of this experiment is called continuous. 

i 

Event* 

An event is defined as a subset of the sample space. In the card- 
draw experiment we may be more interested in getting a king than in any 
one card. Here four elements of the sample space satisfy (make up) the 
event (subset). 

Probability 

Probability is a function of the events defined on a sample space. It 
is a number assigned to each event that defines the relative likelihood that 
the event can occur. The assignment may be made on the basis of common 
sense, from the results of measurements, or any reasonable and justifiable 
basis. For example, it is reasonable in the card-draw experiment that the 
event “draw a king” will have a probability A because there are 4 kings 
and 52 total cards and all cards are presumed equally likely to be selected. 

If events are denoted by capital letters (and elements of the events 
by lowercase letters) such as A, B, C, and so on, we denote the probability 
of an event (such as A) by P{A). 

Joint Probability 

We denote the probability that two events, A and B, will jointly or 
simultaneously occur on a trial of an experiment by P(A, B). For example, 
if a box is filled with 90 thoroughly mixed resistors, as shown in Table 
B.l-1, and if A is the event “draw a 100-ft resistor” and B is the event 
“draw a 2-W resistor,” then P(A, B) = & = 0.1. Note that P(A) = U 
and P(B) = $$ in this example. 


TABLE B.l-1. Resistors in a Box by Resistance and Wattage. 


Resistance 

Wattage 

} w 

1 W 

2 W 

Totals 

10 n 

8 

6 

5 

19 

100 n 

10 

13 

9 

32 

iooo a 

18 

15 

6 

39 

Totals 

36 

34 

20 

90 
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For several events A,, A 2 , ..., As, their joint probability is denoted 
by P{A u A 2 A s ). 

Conditional Probability 

Sometimes we are interested in the probability that an event, such as 
A, occurs, given that some other event, such as B, has occurred. We call 
this conditional probability and denote it by P(A\B). In the example defined 

in Table B.l-1, we have / > (A|U) = fa. Joint and conditional probabilities 
are related by 

P(A, B) = P(A\B)P(B), (B.l-l) 

or, alternatively, 

P(A, B) = P(B\A)P{A). (B.l-2) 

By combining (B.l-2) and (B.l-1), we obtain one form of Bayes’ rule 

P(A\B)P(B) = P(B\A)P(A). (B.l-3) 

Statistical Independence 

Two events A and B are said to be statistically independent if 

P{A\B) = P{A), independent events. (B.l-4) 

An equivalent definition is 

P(A, B) = P(A)P(B), independent events. (B.l-5) 
In the preceding experiment of drawing a resistor from the box of 90 
resistors we have P(A) = ft and P(A\B) = fa so P(A\B) fa P(A) which 
means events A and B are not independent. 

.. „ Multiple events A,, A 2 , A N are said to be statistically independent 
if all the conditions 

P(A it Aj) = P(A,)P(Aj) 

P(A , , Aj , A k ) = P(A,)P(Aj)P(A k ) fB , 


A j a n) = P(A,)P{Aj) - P(A„) 

are satisfied for 1 *i<j<k<- *N. There are 2 N - N - 1 of these 
conditions. 

B.2 RANDOM VARIABLES, DISTRIBUTIONS, 

AND DENSITIES 

In some random experiments outcomes are not numerical, such as in the 
earlier resistor and card-selection experiments. The concept of a random 
variable allows all random experiments to be represented numerically. 
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Random Variabla 

A random variable is defined as a real function of the elements of a 
sample space. Thus points (elements) of a sample space map into points 
(numbers) on the real line through the function. Capital letters (IV, X, or 
T) are used to represent a random variable, and a particular value of the 
random variable is denoted by lowercase letters ( w , x, or y). 

Random variables can be discrete, continuous, or a mixture of the 
two, depending on the sample space and the form of the function defining 
the random variable. 

Distribution Functions 


Let A be the set of points in the sample space that maps into the set 
of real-line points { X =£ x}. These points correspond to all values of the 
random variable X that do not exceed an arbitrary number x. The probability 
of the set A defined on the sample space must equal the probability of the 
set {X x} due to the direct point-to-point mapping involved. This probability 
is a function of x, which we denote by P x (x), t that is given by 


Px(x) = P{X =£ x}. (B.2-1) 

The function P x (x) is called the cumulative probability distribution function 
of X. It is a function having the following properties: 

Px(~°°) = 0 (B.2-2a) 

Px( °°) = 1.0 (B.2-2b) 

0 =£ P x (x) « 1.0 (B.2-2c) 

Px(x,) P x (x 2 ) if x, < x 2 (B.2-2d) 

P{x t < X *£ x 2 j = P x (x 2 ) - P x (x,). (B.2-2e) 


For a discrete random variable, P x (x) will contain steps with amplitudes 
equal to the probabilities of the possible discrete values of X, denoted by 
x lt i = 1, 2, ..., N: 

Px(x) = 'EP{X = x,\u(x - x,), (B.2-3) 

I — 1 

where we define the unit-step function by 



x 0 
x < 0. 


(B.2-4) 


When more than one random variable is involved, the joint probability 
distribution function is defined by 


p x„x 2 x N (x u x 2 , ...,x N ) = P{X { x,, Z 2 =s x 2 , X N =s x N }. (B.2-5) 


t The subscript X indicates PAx) applies to the random variable X. 
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Probability Density Functions 

The probability density function, denoted by p x (x), is defined as the 
derivative of the distribution 


Px(x) = 


dP x {x) 


The properties exhibited by p x (x) aret 

0 « Px(x) 

f p x (x) dx = 1 

— oo 

Px(x) = f Px({) d( 
PAx t <X^x 2 }= f Px ( x) dx. 

J X\ 

For a discrete random variable, 


Px(x) = 2 P{% = x,}8(x - x,). 


(B.2-6) 


(B.2-7a) 


(B.2-7b) 


(B.2-7c) 


(B.2-7d) 


(B.2-8) 


For several random variables the joint probability density function is 

V . ». \ ^ P X , Xj Xn(.X\,X 2 , . . . , X\r ) 

PxuXt x N (Xi,x 2 , . . . , x N ) - - — : . (B .2-9) 

dx, dx 2 dx N 


Conditional Distribution and Density 

By again letting event A = {X s x} and using (B.l-1), we have the 
probability of {X =s x} conditional on some event B having occurred. We 
call this probability the conditional distribution of X, denoted by P x (x\B): 

P x (x\B) = P{X =£ x\B) = (B.2-10) 

Conditional density follows the derivative: 

„ rdm _ dP x(x\B) 


(B.2-10) 


Px(x\B) = 


(B.2-1 1) 


When many random variables, X„ X 2 , ..., X N , are to be considered. 


f Properties (B.2-7a) and (B.2-7b) together are sufficient to guarantee that a function 
PAx) is a valid density function. 
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we extend (B.2-10) and (B.2-11) to obtain 


P X , **(*1, — 


P{X t S x it ..., X N s x N , B} 


d N P x x N ( x \ i ■ • ■ i x,v|B) 


(B.2-12) 


Px t (*. x n \B) = . (B.2-13) 

dX j * * * dXpj 

In the case where event B in (B.2-10) is defined by a second random 
variable Y according to B = {y - Ay <T*£ y + Ay}, with Ay — ► 0 such 
that Y approaches a fixed value y, it can be shown thatt 

P x (x |y) = lim P{X « x\y - Ay < F y + Ay} 

Av— *0 


Px, y(i, y)d£ 


(B.2-14) 




(B.2-15) 


Statistical Independence 

N random variables X,, i = 1 , 2, .... N, are said to be statistically 
independent if their joint distributions (or densities) satisfy the conditions 
of (B.l-6). One particular condition, 


P Xl .Xl X N (•*!> x 2t •••> X n) — 0 P X (*/)> 


(B.2-16) 


proves extremely useful in the analysis of many practical systems. 


B.3 STATISTICAL AVERAGES 


Average of a Function of Random Variables 

Let g(X u X 2 , ..., X N ) be a real function of the N random variables 
X,, i = 1, 2, ..., N. We define the statistical average (also called the 
expected value or mean value ) of g(- , •••, •)» denoted either by g or 
ElgO, ')] by 

1= E[g{X u X 2 ,...,X N )} (B.3-1) 


■CL 


g(xi , x 2 , . . . , x N )p Xu x 2 x N (x, , x 2 x N ) dx t dx 2 . . . dx N . 


t If Y is a discrete random variable, some care must be exercised in using these results. 
See [3, pp. 86-89], 
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Most practical problems require only two random variables, say X and Y. 
In this special case 


g = E[g{X , Y)] 


-n: 


g(x, y)PxA x > y) dx dy. (B.3-2) 


Moments 

The statistical averages of a number of special functions g(X , Y) are 
especially important. These averages are called moments of the random 
variables X and Y. For example, 


X = E[X] = 


xp x (x) dx 


is called the mean or first moment of X (about the origin), and 
T 2 = E[X 2 ]= f x 2 p x (x)dx 


(B.3-3) 


(B.3-4) 


is the power in X (second moment about origin). Another moment (about 
the mean X) is called the variance of X; it is denoted by errand given by 


cr 2 x = E[(X - X) 2 ] 


(x - X) 2 px(x) dx. 


(B.3-5) 


Other important moments are called the correlation and the covariance of 
X and Y and are denoted by R xr and C XY , respectively. These quantities 
are defined by 


R xr = XY = E[XY] 


=£1 


xypx. y(x,y) dx dy (B.3-6) 


C X y = (.X- X)(Y - Y) = E[(X - X)(Y - Y)] 


■a: 


(x - X)(y - Y)p x ,r(.x, y) dx dy. 


Interrelationships of importance are 
crl = X 2 - X 2 


Cry - XY - X Y = Rxy — X Y. 


(B.3-7) 

(B.3-8) 

(B.3-9) 


B.4 GAUSSIAN RANDOM VARIABLES 

A random variable X is called Gaussian if its probability density function 
is given by 


Px(x) = (277-0-*) ,/2 exp 


( x - Xf 
2 cr 2 x 


(B.4-1) 
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This function is plotted in Fig. B.4-1. It is a symmetric function about the 
mean of A" and its spread is proportional to cr x (called the standard deviation 
of X). The normalized Gaussian density results when X = 0 and <r* = 
1. Let the notation p(x ) denote the normalized case. The normalized 


distribution function is then 


P(x) = J~ p(x) dx 

(B.4-2) 

with 


p(x) = (277)“ 1/2 expf- yY 

(B.4-3) 


It is readily shown that the distribution function of the normalized Gaussian 
random variable is related to P x ( x) by 

PAx) = fjAt) d t = p(~^)- (B.4-4) 

The error function, complementary error function, and Q function are 
defined from the normalized Gaussian density by 

erfOS) 4-1= f'e-Pdi, erf(-/3) = -erf(/3) (B.4-5) 

\TT Jo 

erfc(/3) = 1 - erf(/3), erfc(-/3) = 2 - erfc(/3) (B.4-6) 

2(/ 3) = -L f e-^df, G(-/3) = 1 - S(/3). (B .4-7) 

y/2n Ap 



Figure B.4-1. The Gaussian probability density function. 


These are related to the distribution function by 


Pxix) = 1 



= 1 - - erfc 

1 1 - 
= - + - erf 
2 2 


LZ 

\V2 (t X/ 
x - X 

V2 


<?x. 


(B.4-8) 


B.5 RANDOM SIGNALS AND PROCESSES 

The purpose of all the preceding theory, of course, is to provide the basis 
for describing random waveforms. To effectively use the basis, we must 
now introduce the concept of modeling a real random signal by a real 
random process. t 

Random Process Concept 

Consider a real random (noise) waveform, such as shown in Fig. B.5-1 
as x 0 (r), that might exist at a given point in some system. Another system, 
built identically to the first, might generate a different random waveform 
at its similar given point as shown in Fig. B.5-1 as x x (t). Still other systems, 
all built identically, would generate other waveforms, such as x 2 (t), xft), 
and so on. In principle we can imagine an infinity of such identical systems, 
all generating their own waveforms. The collection of these waveforms is 
called an ensemble. The waveforms are different with time, because the 
thermal agitation of electrons in conductors and semiconductors in the 
systems that give rise to the noises is different from system to system, 
even though the systems are built the same. 

Another way of viewing the waveforms of Fig. B.5-1 is to imagine 
them to be the possible waveforms that could have been generated by one 
network. This approach is especially attractive. The ensemble of all possible 
waveforms is called a random process. The actual random signal is only 
one ensemble member of the random process of possible signals that, taken 
together, describe the statistical properties of the signal. For example, at 
time tj the voltage of any one waveform is treated as a specific value of a 
random variable describing all the waveform values of the members of the 
process. The mean value of this random variable is a statistical average 
of all possible voltages that could occur at time t, ; it is called the ensemble 
average at time 

If the random process is denoted by X(t), then the process is interpreted 
as a random variable X at time t. The probability density is denoted by 

t We consider only real processes. However, it is also possible to define complex 
processes [3, 4]. 
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Figure B.5-1. An ensemble of random waveforms x,(t), i = 0, ±1, ±2, . . . 
that comprise a random process. 


p x (x\ t ). The average random signal amplitude (dc value) at time t then 
becomes 


E[X(t)] 


-r 

^ — o 


xp x {x\ t ) dx. 


(B.5-1) 


If Px(x\ t) varies with time, then E\X(tj\ may vary with time. By a similar 
logic we define the power in the process by 

E[X 2 (t)) = j x 1 p x (x\ t) dx (B.5-2) 

which may, in general, vary with time. 

Correlation Function* 


PAxi, x 2 ; 1 1 , t 2 ), the correlation between X, and X 2 , denoted by R X x0i , 
t 2 ), is 

x l x 2 Px(x,,x 2 -t u t 2 )dx l dx 2 . 

(B.5-3) 

Because the correlation is of two random variables taken from the same 
process, it is called the autocorrelation function. With f, = t and t 2 = 
t + r, where r = t 2 — f, is the difference in times, we have 


RxxUu h) = E[X(t,)X(t 2 j] 


jj: 


RxxU, t + r) = E{X(t)X(t + r)]. (B.5-4) 

If two random processes X(t) and Y(t) are considered, the correlation 
between random variables X = X(t) and Y = Y(t + t) is called the cross- 
correlation function of the processes. It is given by 


Rxr(t, t + r) = E[X(t)Y(t + t)]. (B.5-5) 

Alternatively, 

Rrx(t, t + r) = E[ Y(t)X(t + r)]. (B.5-6) 


Stationarity 

The mean, power, and autocorrelation functions are all measures of 
the statistical properties of a process. Many other measures are also possible 
(higher moments). Broadly speaking, if the process’s statistical properties 
do not vary with time, it is called stationary. Although there are many 
precise definitions of stationarity, one of the most broadly useful is wide- 
sense stationarity. A random process X(t) is wide-sense stationary if two 
conditions are true: 

E{X(t)\ = X = constant (B.5-7a) 

E[X(t)X(t + t)] = R xx (.t) (independent of t). (B.5-7b) 

Two processes arc jointly wide-sense stationary if they are separately 
wide-sense stationary and their cross correlation functions are independent 
of absolute time 

E[X(t)Y(t + t)] = R xy (t) (B.5-8) 

E[Y{t)X(t + t)] = R yx (t). (B.5-9) 

An ergodic process is a more restricted form of random process. It 
is one in which the time averages of a single ensemble member of the 
process are equal to the corresponding statistical averages. If time averages 
are defined by 

^[•] = li mf T [ [•] dt 

r~*°° Zl J -j 


If random variables defined from the process X{i) at times f, and t 2 
are A'j = Z(t,) and X 2 = X(t 2 ) with joint probability density denoted by 


(B.5-10) 
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as usual, then for the ergodic process 

= E[X(t)] = rf[x(t)] 4 x _ (B.5-11) 

* 2 = £[Z 2 «] = ^U 2 (/)] 4 *2 (B.5-12) 

Rxx(r) = E[X(t)X(t + t)] = d[x{t)x(t + r)] = SkJj), (B.5-13) 
where we use 5? to represent a time-correlation function. 

For jointly ergodic processes 

Rxy(t) = ^xy(r) = s4[x{t)y(t + r)] (B.5-14) 

Ry x(j) = %x(t) = s4[y(t)x(t + t)]. (B.5-15) 

Although statistical averages are most often used in modeling and 
theoretical analysis of systems, it is the time averages that are most readily 
measured (approximately) in practice. If the process is ergodic, the two 
sets of results are equal and the measured results will agree with the theory. 
In practice, processes are therefore often assumed ergodic without actual 
proof of the assumption.? However, even if the process is not ergodic the 
validity of the assumption is of little consequence if the theoretical and 
practical results agree within reasonable bounds, as they often do. 


B.6 POWER DENSITY SPECTRUMS 

The preceding section discussed ways of describing random processes using 
the time domain. As with deterministic signals, random signals may be 
described in the frequency domain. The description involves a power density 
spectrum instead of a voltage density spectrum, however. 

Stationary Processes 


If X T (t) represents a random process X(t) truncated to -7s ( s I 
(and zero for |t| > T), it can be shown that the process has a power density 
spectrum 


y. xA w ) — lim 

T—+OQ 


£[|*rM 2 ] 

2 T 


(B.6-1) 


where X T (w) is the Fourier transform of X T (t) and the process is assumed 
to be at least wide-sense stationary.? The power spectrum is a real, non- 
negative, even function of ca for real processes. 

For jointly wide-sense stationary processes, X(t) and Y(t), having trun- 
cated representations X T (t) and Y T (t) with respective Fourier transforms 


t Proof that a process is ergodic is difficult. 

X The use of the capita! X in both X T (o>) and X r (l) does not imply the same function 
with different arguments here. It is to be hoped that the reader will be able to resolve this 
minor clash of notation through context. 
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X T {u>) and Y T (a>), cross-power density spectrums may be derived as follows: 

„ , , E[Xf(w)Y T (w)] 

SVM = hm — (B.6-2) 

7--+O0 Zi 

, ,. £[T?(o>)Z r (ca)] 

SV(w) = hm — . (B.6-3) 

T— eoo Zi 


Power and cross-power density spectrums of jointly wide-sense sta- 
tionary processes are related to correlation functions as Fourier transform 
pairs 


Rxx(t) *- 

y xx (w) 

(B.6-4) 

Ryy(t) *— 

y yy(° j ) 

(B.6-5) 

Rxy(t) *- 

y xy(w) 

(B .6-6) 

Ryx(t) 

—* y yx(w)- 

(B.6-7) 


Nonstationary Processes 

Even if a random process is not stationary, an average power density 
spectrum can be defined as the Fourier transform of the time average of 
the process autocorrelation function 

?xx(.<o)= J m X xit,t + r)]e-^dr. (B.6-8) 

mJ -so 

Thus 

m x AY t + r)] = ^ J y x Aco)e Jm dco. (B.6-9) 

In the nonstationary case the autocorrelation function cannot be recovered 
from its average power spectrum; only the time-averaged autocorrelation 
function is recoverable. 

In a similar manner, average cross-power density spectrums can be 
defined as the Fourier transforms of the time-averaged cross-correlation 
functions. 


Power 


Power in a random process X(t) derives either from its autocorrelation 
function or its power density spectrum: 


X 2 = E[X 2 (t)] = RxA 0) 



y xxiw) dco. 


(B.6-10) 
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B.7 RANDOM SIGNAL RESPONSE OF NETWORKS 
Fundamental Result 

A random signal x(t ) applied to the input of a single-input, single- 
output linear time-invariant network will generate an output, or response, 
given by the convolution of x{t) with the network’s impulse response h(t): 


y(t) = f h(?)x(t - i) d{ 

•J — oo 

= J x(t)h(t - 0 df. 


(B.7-1) 


The fundamental result (B.7-1) is all that is required to derive all characteristics 
of the random response y(t). 

Since each waveform of a random process X(t) is affected by the 
network through (B.7-1), the responses correspond to members of a new 
random process Y(t). As a consequence of this fact we can interpret (B.7-1) 
as a transformation of one process to another 


Y{t)= f h(€)X(t - f ) d{ 
= f X(i)h(t ~ O df. 


(B.7-2) 


Output Correlation Functions 

If X(t) is wide-sense stationary and the network is linear and time 
invariant, it can be shown that the response process Y(t) is wide-sense 
stationary and X(t) and Y(t) are jointly wide-sense stationary. We state 
results for only this case. 

The mean, power, and autocorrelation function of the response are 


R yy (t) 


Y = E[Y(t)]=X f h(f)d{ (B.7-3) 

•J — oo 

/* oo /*oo 

' 2 = E[Y\t)\ = / / R x Mx- (B.7-4) 

— oo •J — oo 

/~0O /*(X 

r) = J J RxAr + d{ t d{ 2 . (B.7-5) 


Y 2 = E\Y 2 (t)\ 


(B.7-5) 
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Cross-correlation functions are 

Rxrir) = f R X A r ~ £)h(t) d£ (B.7-6) 

RyJt) = f R x At - Oh(- Z)d£. (B.7-7) 

J — CO 

Power Density Spectrums 

Let a random process X(t), that is at least wide-sense stationary with 
power density spectrum £Y xx (o)), be applied to the input of a linear time- 
invariant network having a transfer function H((o ). t The response random 
process Y(t) will have a power density spectrum 

&yy(<o) = y xx (a>)\H(aj)\ 2 . (B.7-8) 

Cross-power density spectrums are related to as follows: 

y’xyito) = y xA(d)H{(ti) (B .7-9) 

SY y A(o) = y x A<d)H( — (o). (B.7-10) 


B.8 BANDPASS RANDOM PROCESSES 


Next we assume X(t) is a bandlimited wide-sense stationary, zero-mean 
real random process having a power density spectrum defined by 


V x A<o) 1 1 J; 


0 <t &)| C |oj| < 0>1 + By- 
elsewhere. 


(B.8-1) 


Here cv, is a constant and W f is the spectral extent of the process. In 
practice, random signals are rarely bandlimited. However, there are always 
frequencies between which the power spectrum is largest and outside which 
it is negligible. This means that the assumed power spectrum form is 
reasonable for most practical problems. 

The random process X(t) can be represented in the very useful form 
X{t) = X c (t)cos(oj 0 ! + ff 0 ) - X s (t)sin(oj 0 t + 6 0 ), (B.8-2) 

where co 0 and 8 0 are constants; X c (t) and X s (t) are lowpass random processes 
with the following properties [3]: 

(1) X c (t) and -Y,(/) are jointly wide-sense stationary (B.8-3a) 


(2) E[X c (e)l = 0, E[X s (t)} = 0 (B.8-3b) 

(3) E[Xlit)} = E{A,(t)} = E[X\t)\ (B.8-3c) 


t We assume the impulse response h(t) = is a real function. 
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(4) Rx.xff) - R XcX ff) ~ ~ I Sf xx (oi)cos[loi - o> 0 )t] dot (B.8-3d) 

(5) Rx.x/j) = - RxcxM = — I 5^ x ^co)sin[(oj - a> 0 )T]dw (B.8-3e) 

IT J 0 

(6) SfxcxX 0 *) — ’xpeX w ) — L p [y xx lot - oj 0 ) + !F xx (ot + o) 0 )] (B.8-3f) 

(7) £f xjiX 0 *) ~ ~ 3'x'X , •,(«>) = ~ JLpWx^to ~ "o) _ y’xxk* + «>o)]. 

(B.8-3g) 

Here ot 0 is any convenient frequency in the band o>, < &>„ < at, + W f and 
Lp[‘] represents taking the lowpass portion of the quantity in brackets. 

One of the most useful applications of (B.8-2) is when Sfjofat) is 
symmetric about &) 0 . In this case, R XcX ,lr) = 0. Rx,x,(t) = 0, Sf XcXl lat) = 
0 and SF XsXl (.ot) = 0. 


B.9 MATCHED FILTERS 

In a number of systems a deterministic signal fit) is transmitted and received, 
along with noise nit) at the receiver. The receiver’s problem is often to 
decide, at some instant in time t 0 , whether the signal and noise or just 
noise is present. Its ability to make this decision is enhanced if the ratio 
of the signal’s power at time t 0 to average noise power is large. In fact, 
it is possible to select a specific filter, called a matched filter, that will make 
this signal-to-noise ratio maximum. We examine such filters in this section. 

Colored Noise Case 


Consider an arbitrary linear time-invariant filter with impulse response 
and transfer function h(t) and //(co), respectively. The response f 0 (t) generated 
by the application of f(t) to the filter’s input is 



F{o>)H{a>)e iM dot 


(B.9-1) 


where 


/« <— >• Flat). (B.9-2) 

Average output noise power N 0 is 

N o = ^f_ ^ NN (ot)\H(ot)\ 2 dot (B.9-3) 

where SV/co) is the power density spectrum of the noise n(t) represented 
by the random process Nit). 

The ratio of signal power at time t 0 to average noise power, which 


B.9 Matched Filters 
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we wish to maximize, is 


By applying Schwarz's inequality A we have 


(£)/_ F(a>)H(a>)e*«° d<o 

Q;) f dot 

hequalityf we have 

± fM,,. 

Nj 2 tt if NN (ot) 


(B .9-4) 


(B .9-5) 


The maximum (equality) occurs only when we choose the optimum transfer 
function as 


H op ,(ot) = 


2trC r/ NN (ot) 


(B.9-6) 


This filter is called matched because its transfer function depends on the 
signal's spectrum Flat). If the signal is changed, the optimum filter must 
change; it, therefore, must be matched to the signal used. 


White Noise Cese 

In the special case of white noise where Hf NN lot) = NJ2 (constant) 
at all frequencies, the optimum filter transfer function is 

H op ,(ot) = —^—F*(ot)e- JM \ (B.9-7) 

7 T JSqC 

Its impulse response is readily found to be 

h op , it) = —TTFf*(to ~ t). (B.9-8) 

7T Jl 0 C 

If fit) is real, this optimum impulse response is a replica of fit) centered 
at t 0 but “running backwards” in time. 
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PROBLEMS 

B-l. In a classroom there are 87 students; 43 have blond hair, 29 have black hair, 
and 15 have brown hair. The instructor randomly selects a student to answer 
a question, (a) For this random experiment define a trial, the outcomes, 
and sample space, (b) What is the probability the selected student’s hair is 
brown? (c) What is the probability the selected student’s hair will be blond? 

B-2. Two cards are drawn randomly from a thoroughly shuffled ordinary deck 
of 52 cards. What is the probability that both cards will be aces? 

*B-3. Extend Prob. B-2 to the drawing of four cards that are all aces. 

B-4. A random experiment consists of drawing a resistor from a box of resistors 
as defined in Table B. 1-1. Define events A = “draw a 10-fl resistor,” B = 
“draw a 1000-ft resistor,” and C = “draw a 1-W resistor.” Find the 
probabilities: (a) P(A), (b) P(B), (c) P(C), (d) P(A, B), (e) P(A, C), (f) P(A\B), 
(g) P(A\C), (h) P(C\A), (i) P(A, B , C), and (j) P(A\B, C). 

B-5. Are the events A, B, and C of Prob. B-4 statistically independent in any 
sense, that is, in pairs or as a triple? 

B-6. A random experiment consists of throwing two dice and observing the sum 
of the two numbers that show up. (a) Define a sample space for this experiment, 
(b) Assign probabilities to the elements of the sample space and justify the 
choices, (c) What are the probabilities of the events A = “sum is 1,” 
B = “sum is 6,” and C = “sum is 12”? 


B-7. In a box there are 10 green, 8 red, 16 white, 15 blue, and 3 black balls. A 
ball is randomly selected, (a) What is the probability that the ball is red? 
(b) What are the probabilities for the other colors? 

B-8. A production line manufacturing gunpowder can explode if two control 
circuits A and B jointly fail. Their probabilities of failure are known to be 
0.001 and 0.004, respectively. It is also known that P(B\A) = 0.006. 
(a) What is the probability of the line exploding? (b) What is P(A|fl)? (c) 
Are events A and B statistically independent? 

B-9. An experiment successfully teaches a rat to select a particular color door 
to open and receive food. The rat is repeatedly placed before nine doors 
from which it chooses one. Each door is painted on its left and right halves 
with one of the three colors red (R), yellow (T), and blue (B). After many 
trials it is found that the rat chooses the various doors according to the 
probabilities of Table PB-9. (a) What is the probability the rat will choose 
a door with red on it? What are the same probabilities for yellow and blue? 
To what color was the rat taught to respond? (b) Find the probabilities that 
the rat chooses pairs of colors. That is, find probabilities P(R, V), P(R, B), 
and P(Y, B). What color does the rat seem to prefer most as a second color 
choice? 


TABLE PB-9. Probabilities of Door Selections by a Rat. 



Door Right Side Color 

R 

Y 

B 

Door 

Left 

Side 

Color 

R 

0.05 

0.12 

0.10 

Y 

0.15 

0.20 

0.08 

B 

0.13 

0.10 

0.07 


B-10. Define equations that must be true if three events A,, A 2 , and A, are to be 
statistically independent. 

B-ll. The sample space, denoted by the set S, in a random experiment is S = 
{1, 2, 3.5, 6}. For each random variable X defined, find the set of values 
X may take on. Here s,, i = 1 , 2, 3, 4 are the elements of 5, referred to 
generally by s. 


(a) X = 6s 


(b) X = 12j 3 - 2r 


(d) X = 

(e) X = 

(f) X = 



“I 


B-12. A sample space contains all numbers s defined by 2 =s s =s 5. Rework Prob. 
B-ll using this new sample space. 
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B-13. A random variable X is discrete and has the possible values of the set {1, 
3, 5, 9, 13}. The respective probabilities of the values form the set {0.05, 
0.15, 0.25, 0.40, 0.15}. (a) Plot the distribution function of A", (b) Plot the 
density function, (c) Find the probability P{X *£ 7.0}. 

B-14. The random variable of Prob. B-13 is converted to a new random variable 
Y according to Y = 2X - 6. (a) Plot the distribution function of Y. 
(b) Plot the density function of Y. 

★B-15. A random variable X is transformed to a new random variable Y through a 
transformation Y = T(X). Let T(-) be either a monotonically increasing or 
decreasing continuous function and let AT be a continuous random variable 
with probability density function pAx). Show that the density function p Y (y) 
is given by 


B-14). 


Pv(y) = pAx) 


dx 

dy 


where x = T '(y) and 7”'(-) is the inverse of T( ). 

If a continuous random variable X is transformed to a new random variable 
Y = aX + b, a and b being real constants, use the results of Prob. B-15 
to show that the density of Y is 


Pr(y) = Px 


'y-b\ 1 

/W 

B-17. A random variable X is called uniformly distributed on (a, b), with b > a 
if 


r 1 

7 , a < x <b 

pAx) = \ b ~ a 

[0, elsewhere. 

(a) Find all moments, denoted by m„, defined by 

m n = E[X"], n = 0,1,2,.... 

(b) Find the variance of X. 

B-18. The uniform random variable of Prob. B-17 is transformed to a new random 
variable Y = CX 3 where C > 0 is a constant, (a) Find and plot the density 
function of Y. (b) For a > 0 find and plot the distribution function of Y. 
B-19. (a) Find the distribution function of the uniformly distributed random vari- 
able of Prob. B-17. (b) Let x , = 0.9a + OAb and x 2 = 0.2a + 0.8 b. Find 
P{*, < X x 2 }. 

B-20. Determine which of the following functions are not valid density functions 
and state reasons why. 


(a) q(x) = (b)u(x)exp(-x/6) 

(b) q(x) = u(x + 6) - u(x - 10) 

(c) q(x) = [u(x + 1) - u(x - l)](i)cos( 7 rjc). 
B-21. Work Prob. B-20 for the given functions. 

(a) q(x) = -2u(x)exp( ~x 2 /4) 

(b) q(x) = (2n)' in expi-x 1 ) 

(c) q(x ) = 12(1 + x ! )~ 2 . 


B-22. The Laplace density as defined by 

pAx) = ae~ ]x - mVi 

is a valid density function for arbitrary real constants m and b if a is chosen 
properly. Find: (a) the proper value of a, (b) the mean, and (c) the variance 
of the random variable X. 

B-23. The exponential random variable X has a density 

pAx) = (£)««ex P (-f) 

for b > 0 a real constant. Find: (a) the mean, (b) the variance, (c) the 
distribution function of X, and (d) the probability that the random variable 
can have values larger than its mean. 

B-24. The Rayleigh random variable X is defined by the density 

PAx) = (y)«Wexp(-£) 

for b > 0 a real constant. Find: (a) the mean, (b) the variance, (c) the 
distribution function of X, (d) the mode of X, defined as the value of X at 
which pAx) reaches a maximum, and (e) the probability that X can have 
values exceeding the mode. 

B-25. A Rayleigh random variable X has b = 4 (see Prob. B-24). Find P{X =s 3}. 
B-26. Two statistically independent Rayleigh random variables X and Fare defined 
as in Prob. B-24 with b equal to 2 and 4, respectively. Find P{1 5 < A - 
1.0 < Y =£ 2.0}. 

B-27. The joint density function of random variables X and Y is 


Px.rix, y) 



(a) Find P{Y « X/2). (b) Find P{Y « 2X). 

*B-28. The density function of a single random variable can be obtained by integrating 
out all other random variables in a joint density function. The resulting 
density is called the marginal density function. Thus for two random variables 
X and Y, 


PAx) = I Px.r(x,y)dy 

•J — oo 

Pr(y) = J~ Px.r(x,y)dx. 


Use these results to find the marginal densities of X and Y when 
Px.r(x,y) = u(x)u(y)xtxp(- x - xy). 

By application of (B.2-15), also find the conditional densities pAx\y) and 
Pr(y\x). 

B-29. Find (a) the mean of X, E[X ], (b) the mean of Y, E[Y], and (c) the correlation 
of A" and Y, R X y = E[XY] for random variables X and Y defined by the joint 
density in Prob. B-28. 


412 


Review of Random Signal Theory Appendix B 


B-30. A Gaussian random variable X has mean X = 2 and variance ai = 1 44 
Find P{ 0.32 < X s 2.84}. 

B-31. What is the probability that the magnitude of a zero-mean Gaussian random 
variable can have values larger than twice its standard deviation? 

B-32. A zero-mean Gaussian random variable X with variance 4 is changed to a 
new variable Y according to Y = 8 + (X/2). (a) Find and plot the density 
function of Y. (b) What are the mean and variance of Y2 (c) Find £LY 2 1 
and £[F 2 ]. 

*B-33. Let X and Y be statistically independent random variables with respective 
densities pufx) and p Y (y). Show that the density of the sum W = X + Y 
is given by 



Px(w ~ y)Pr(y) dy. 


C Hint : Equate probabilities P w (w) = P{W =s w} = P{X + Y =s w}. ) 

B-34. Show that the first right-hand-side equation in (B.4-8) is true. 

B-35. Show that the random process XU) = A cos(co x r + <f> x + Q x ) is wide-sense 
stationary, where A, <j> x , and a> x are real constants; B x is a random variable 
uniform on [-n, if). 

B-36. Work Prob. B-35 except for the process Y(t) = B cos (a> Y t + <j> r + B x ). 

B-37. (a) Find the cross-correlation function E[X(t)Y(t + t)] of the random processes 
defined in Probs. B-35 and B-36. (b) What conditions can be placed on the 
various constants such that X(t) and Y(t) are jointly wide-sense stationary? 

B-38. Show that (B.5-11) and (B.5-13) are true for the random process of Prob 
B-35. 

B-39. Find the autocorrelation function and power spectrum of the process of 
Prob. B-35. 

B-40. The power spectrum of a process X(t) is 

y xx (m) = ^ 

i + (w 2 / loo)' 


(a) Use (B.6-10) to find the power in the process, (b) Find the process’s 
autocorrelation function, (c) Verify the power in X(t) by the relationship 

£[* 2 «] = R xx ( 0). 

B-41. Find the power in processes having the following power spectrums: 


(a) y xx i(o) — 

(b) if xx(co) — 


12 

(10 4 + oj 2 ) 2 
100 (u 2 
(100 + a, 2 ) 2 


(O ifxxiu) = 8 exp 


(d) if xx (o>) — 


CO 2 / 10 

(10 + co 2 ) 3 ' 
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B-42. Assume the cross-power spectrum of two processes is defined by 

f a + 17 /> — W < co < W 

^xr(a)) = \ W 

[0, elsewhere, 

where W > 0 and a is a real constant. Find the crosscorrelation function 
Rxy(t)- 

B-43. Define a process X(l) as in Prob. B-35 except let 0* be uniform on [0, tt/ 2] ; 
it becomes a nonstationary process, (a) Find the autocorrelation function 
of X(t). (b) Find the time average of the autocorrelation function, (c) Find 
the power spectrum of X(t). 

B-44. White noise with autocorrelation function (A 0 /2)S(r) is applied to a lowpass 
filter for which H(ai) — [1 + /(cv/10)] -1 . (a) Find the filter’s impulse response, 
(b) Find the autocorrelation function of the output noise process Y(t) by use 
of (B.7-5). (c) Use (B.7-6) to find R xy (t). 

B-45. Show that (B.7-3) is equivalent to 

T = XH( 0) 

where h(i) <-* H(a>). 

B-46. Begin with (B.7-2) and derive (B.7-5) and (B.7-6). 

B-47. Show that (B.7-5) can be written in the form 


)= / RxrU + mO di 

— oo 

= f RrxB - WO df. 

mJ _ oo 


B-48. By Fourier transformation of R yy (t), using (B.7-5), show that (B.7-8) is true. 
B-49. Fourier transform the crosscorrelation functions of (B.7-6) and (B.7-7) to 
show that (B.7-9) and (B.7-10) result. 

B-50. If H( w) is the transfer function of a network, its noise bandwidth W N (rad/s) 
is defined by 


f l#M 2 


where cu 0 is the midband frequency (rad/s). For a lowpass network w 0 
0. Find the noise bandwidth for the network for which 


H(l0) [1 + (w/W) 2 ] 2 

with a and W > 0 constants. 

B-51. If white noise is applied to a linear network, show that the output noise 
power is the same as if the network were replaced by an ideal filter (rec- 
tangularly shaped |/f(w)|) with the same midband gain and bandwidth equal 
to the network’s noise bandwidth. 
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B-52. The power spectrum of a bandpass random process X(t) is 
it xx ( oj ) = P{u(at - ti) 0 + W L ) — u(co - o) 0 - W H ) 

+ u(oj + <x> 0 + - **(*• + <»o - W/.)], 

where P, o> 0 , and W H are constants, W L + W H = IV, and io 0 > W. 
(a) Plot it xx (m) . (b) Find and plot the power and cross-power density spectrums 
of X c {t) and X,(t) used in the representation of (B.8-2). (c) What is the largest 
bandwidth that X c (t) and X,(t) may have if IV is to be kept constant and 
what must W L and W H be for this bandwidth to occur? (d) Repeat part (c) 
to find the smallest bandwidth. 

B-53. Find the crosscorrelation function R XcXi (t) for the process defined in Prob. 
B-52. 

*B-S4. Let a wide-sense stationary random process X(t) be applied to one side of 
a product device. The other input to the product is the signal Acosftoof + 
d 0 ) where A, a> 0 and 0 O are constants. Show that the power spectrum of the 
output Y(t) of the product device is 

o- 

y rr (o)) = — [itjufui - cd 0 ) + it + aid)] 

where it xx (<o ) is the power spectrum of X(t). 

B-55. Derive (B.9-5) from (B.9-4) using Schwarz’s inequality. 

B-56. A rectangular pulse of duration r 0 defined by 

/(r) = ArectM^‘ 

J 0 

is applied, along with white noise of power density JfJ 2 to a matched filter, 
(a) Find H V M if 0) = 1 is required, (b) Find MO- 
B-57. Work Prob. B-56 for the triangular pulse 

T(f ~ r.)l 

f(t) = Atn . 

. T » J 

B-58. For a signal fit) in white noise with power density A 0 /2, show that (B.9-5) 
can also be written as 

B-59. Show that (B.9-8) is true. 


Appendix C 

Trigonometric 

Identities 


cos(x ± y) = cos(x)cos(y) + sin(x)sin(y) (C-l) 

sin(x ± y) = sin(x)cos( y) ± cos(x)sin(y) (C-2) 

cos^x + = + sin(x) (C-3) 

sin^x ±~^ = ± cos(x) (C-4) 

cos(2x) = cos 2 (x) - sin 2 (x) (C-5) 

sin(2x) = 2 sin(x)cos(x) (C-6) 

2 cos(x)cos(y) = cos(x - y) + cos(x + y) (C-7) 

2 sin(x)sin(y) = cos(x — y) — cos(x + y) (C-8) 

2 sin(x)cos(y) = sin(x - y) + sin(x + y) (C-9) 

2cos 2 (x) = 1 + cos(2x) (C-10) 

2 sin 2 (x) = 1 - cos(2x) (C-l 1) 

4 cos 3 (x) = 3 cos(x) + cos(3x) (C-l 2) 

4 sin 3 (x) = 3 sin(x) - sin(3x) (C-l 3) 

8 cos 4 (x) = 3 + 4 cos(2x) + cos(4x) (C-l 4) 

8 sin 4 (x) = 3-4 cos(2x) + cos(4x) (C-l 5) 
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where 


A cos(x) - B sin(.v) = R cos(x + 6) (C-16) 

R = a/A 2 + B 2 (C-17) 

6 = tan ~\B/A) (C-18) 

A = R cos(0) (C-19) 

B = R sin(0) (C-20) 

2 cos(;t) = e* + e~ ix (C-21) 

V sin(jr) = e* - e~ lx (C-22) 


Appendix D 

Useful Integrals 
and Series 



(D-l) 

(D-2) 

(D-3) 

(D-4) 

(D-5) 

(D-6) 

(D-7) 
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(D-23) 

(D-24) 

(D-25) 

(D-26) 

(D-27) 

(D-28) 

(D-29) 


(D-30) 


Appendix E 

Gaussian (Normal) 
Probability Density 
and Distribution 
Functions 

For zero mean and unit variance: 


P(x) = — 7 = e ~* 2/2 
yin 

(E-l) 

P(x) = -4= f e~* n du 

yh r J -on 

(E-2a) 

P(- x) = 1 - P(x) 

(E-2b) 
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TABLE E-l. The Gaussian or Normal Density and Distribution. 
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Appendix F 

Table of Error 
Functions 


The complementary error function is defined by 

erfc(x) = 1 - erf(x) (F-l) 

where erf(x) is the error function given by 


For negative x 




e~ (2 d{. 


erf(-jc) = -erf(x) 
erfc(-x) = 1 + erf(x) = 2 - erfc(x). 
For x 3= 2 the approximation 


(F-2) 


(F-3) 

(F-4) 


erfc(x) == —f=, x » 1 (F-5) 

xyn 

has 10.5% or less error. Equations (F-l), (F-2), and (F-5) are tabulated. 


X 

erf (x) 

erfc (x) 

e-' 2 

x\frr 

0.0 

0.0 

1.0 


0.1 

0.1125 

0.8875 


0.2 

0.2227 

0.7773 


0.3 

0.3286 

0.6714 


0.4 

0.4284 

0.5716 


0.5 

0.5205 

0.4795 


0.6 

0.6039 

0.3961 


0.7 

0.6778 

0.3222 


0.8 

0.7421 

0.2579 


0.9 

0.7969 

0.2031 


1.0 

0.8427 

0.1573 

0.2076 

1.1 

0.8802 

0.1198 

0.1529 

1.2 

0.9103 

0.8969 (10“ ! ) 

0.1114 

1.3 

0.9340 

0.6599 (10- ') 

0.8008 (lO' ') 

1.4 

0.9523 

0.4771 (10-') 

0.5676 (10“') 

1.5 

0.9661 

0.3389 (10-') 

0.3964 (10“ ') 

1.6 

0.9763 

0.2365 (10-') 

0.2726 (10“') 

1.7 

0.9838 

0.1621 (10"') 

0.1844 (10“') 

1.8 

0.9891 

0.1091 (10-') 

0.1228 (10"') 

1.9 

0.9928 

0.7210 (10" ! ) 

0.8033 (10“ 2 ) 

2.0 

0.9953 

0.4678 (10- 2 ) 

0.5167 (10“ 2 ) 

2.1 

0.9970 

0.2979 (10- 2 ) 

0.3266 (10- 2 ) 

2.2 

0.9981 

0.1863 ( 1 0 - 2 ) 

0.2028 (10“ 2 ) 

2.3 

0.9989 

0.1143 (10- 2 ) 

0.1237 (10“ 2 ) 

2.4 

0.9993 

0.6885 (lO" 3 ) 

0.7408 (10“ 3 ) 

2.5 

0.9996 

0.4070 (lO’ 5 ) 

0.4357 (10“ 3 ) 

2.6 

0.9998 

0.2360 (10- J ) 

0.2515 (lO" 3 ) 

2.7 

0.9999 

0.1343 (10-’) 

0.1426 (10"’) 

2.8 

0.9999 

0.7501 (lO - *) 

0.7932 (lO" 4 ) 

2.9 

1.0000 

0.4110 (10-*) 

0.4331 (10- 4 ) 

3.0 

1.0000 

0.2209 (lO - *) 

0.2321 (10" 4 ) 

3.5 

1.0000 

0.7431 (10-‘) 

0.7713 (10“ 6 ) 

4.0 

1.0000 

0.1542 (lO' 7 ) 

0.1587 (10“ 7 ) 

5.0 

1.0000 

0.1564 (10-") 

0.1567 (10“") 
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Appendix G 


Table of Useful 
Fourier Transform 
Pairs 


In the accompanying table the functions /(•), /„(•), F„(-) and F(-) may be 

complex. 

Constants f 0 , co 0 , a, r 

, W, and <t are real, whereas a n may be 

complex. 

Special functions are defined as follows: 


«(0 = j 

[l, f>0 

' 0, { < 0 


rect(|) = 

r l, |f| < 1/2 

.0, |f| > 1/2 


tri(f) = | 

V A 
1 

^ © 


Sa(fl = i 

: 'sin(f) 


sgn(^) = • 

f 1, f>0 

l-i. f<o. 

Pair Number f(t) 

F{<o) Notes 

1 

/.(O 

FJM 

2 

2 “/.(d 

n- 1 

2 «/■.(«) 

3 

/(f - f«) 

F(&i)e~ y "° 

4 


F((D - <D 0 ) 

5 

fiat ) 

iM:) 
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6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 


t > 0 
W > 0 
T > 0 
W > 0 


22 

cos((u 0 r) 

7r[8((D — CD 0 ) + 8((D + (Do)] 


23 

sin(cu 0 r) 

-yir[8((D - (d 0 ) - 8(&> + (d 0 )] 


24 

«(f) e -“ 

1 

a + j<o 

<2 > 0 

25 

u(t)te~°' 

1 

(a +jcd) 2 

22 > 0 

26 

u(t)t z e~" 

2 

(a +JW) 1 

22 > 0 

27 

uit)^e~°' 

6 

(a + 7 (d)* 

22 > 0 

28 

C -.W 

la 

a 1 + a , 2 

22 > 0 

29 

e -|2/»r2) 

o-v5w e-^f 2 

cr > 0 
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Absolute bandwidth, 385 
Adaptive delta modulation (ADM), 142, 
202, 216 

feedback control in, 216 
forward control in, 216 
hybrid configurations, 223-24 
instantaneous step-size control, 218 
noise performance, 220-23 
syllabic step-size control, 219 
A/D conversion (see Analog-to-digital 
converter) 

A-law compressor, 78 
Aliasing, 17 

error, mean-squared, 17 
fractional error, 18 
ALOHA system, 130 
slotted, 130 

Amplitude shift keying (ASK), 300 
bandwidth, 250 

bit error probability, 249-50, 304 
coherent, 247-51 
local carrier generation in, 251 
Af-ary, 346-49 
noncoherent, 300-304 
OOK version, 247 
threshold, 302 
word error probability, 348 
Analog message, 4 
Analog signal, 4 
Analog source, 4 
Analog-to-digital converter, 6, 58 
Antipodal signals, 155 
Aperture, 45 

A posteriori probabilities, 147, 340 
A priori probabilities, 148 


ASK ( see Amplitude shift keying) 
Asynchronous multiplexer, 120 
AT&T, 121 

Autocorrelation function 
of deterministic signals, 381 
of periodic signals, 383 
of random signals, 401, 404 
AWGN channel, 8 


Bandlimited white noise, 144, 235, 292 
Bandpass random process, 405-6 
Bandpass signal, 5 
Bandwidth 
absolute, 385 
efficiency, 178n 
of networks, 386 
noise, 413 
rms, 205, 241, 385 
three-dB, 384 
Baseband signals, 5 
Baseband system, 9 
Basic DM, 198 
Basis function set, 336 
complete, 337n 
Baudot code, 3 
Bayes’ rule, 393 
Bessel functions, 295 
approximation for, 304 
Binary digit (see Bit) 

Binit, defined, 78n 
Binomial coefficient, 89 
Biorthogonal signal sets, 362 
Bi-phase format, 106 
Bi-phase-mark format, 106 


i 
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Bi-phase-space format, 106 
Bipolar format, 105-6, 1 17-19 
Bit 

binary digit, 7, 78 
control, 124 
as information unit, 82 
least significant, 79 
most significant, 79 
Bit error probability 
in coherent ASK, 249-50 
in coherent binary systems, 152-55 
in coherent FSK, 264-65 
in CPFSK, 267 

in differentially detected systems, 315 
in DPSK, 312 
in duobinary system, 191 
in MSK, 288-90 
in noncoherent ASK, 304 
in noncoherent FSK, 306 
in OQPSK, 260 
in PCM, 157-62 
in PSK, 254 
in QPSK, 256, 258, 364 
Bit interleaving, 120 
Bit stuffing, 124 
Block codes, 88 
decoding of, 94 
Bursts, 129 


CCITT, 124 

CFDM (see Constant factor delta 
modulation) 

Channel, 6 
AWGN, 8 
bandlimited, 186 
encoding, 87 
Channel bank, 121 
Clock, 49 
Codec, 124 
Code(s), 2 
Baudot, 3 
binary, 2 
block, 88 
check digits, 88 
constraint length, 88 
convolutional, 88 
defined, 78 
efficiency of, 83 
error correction, 89 
folded binary, 80 
formatting of, 102 
gain, 88, 164 
generator matrix, 92 
Golay, 164 
Gray, 81 
group, 88 
Hamming, 90 
Huffman, 83-84 
inverted folded binary, 80 
length, 2, 7n 
A/-ary, 2, 81 
memoryless, 88 
Morse, 3 

natural binary, 79 


parity check matrix, 90 
perfect, 90 
rate, 88 

redundancy of, 7, 88 
size, 7n 
syndrome, 91 
systematic, 90 
tail, 99 
tree, 88 
Collision, 130 
Compandor, 72 
Complementary error function 
approximation, 422 
defined, 398, 422 
tabulated, 423 
Compressors, 72 
A-law, 78 
/x-law, 74-75 

Conditional probability, 393 
Conditional probability density, 395-96 
Conditional probability distribution, 395— 
96 

Constant factor delta modulation (CFDM), 
218 

Continuously-variable slope DM 
(CVSDM), 219 

Continuous phase FSK (CPFSK), 266 
Control bits, 124 
Convolution, 377 
Convolutional code, 88 
decoding of, 98-102 
state diagrams, 94 
tree diagrams, 94 
trellis diagrams, 94 
Correlation 
functions, 400-402 
of random variables, 397 
Correlation receiver 
in coherent binary systems, 149-50 
in M-ary systems, 343-44 
in noncoherent binary systems, 295-97 
Costas loop, 261 
Covariance, 397 

CPFSK (see Continuous phase FSK) 

Crest factor, 64n, 165 
Cross correlation functions, 401, 402, 405, 
406 

Cross power density spectrum, 403, 405, 
406 

Cumulative probability distribution 
function (see Probability 
distribution function) 

CVSDM (see Continuously-variable slope 
DM) 


D/A conversion (see Digital-to-analog 
converter) 

Data encryption, 57n 
Decision regions, 341 
Delay constant, 386 
Delay modulation, 112 
Delta function (see Impulse function) 
Delta modulation (DM), 142, 198 
channel bandwidth, 202 
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Delta modulation (DM) (coni.) 
compared to PCM, 21 1—12 
with double integration, 203, 206 
granular error, 200 
noise performance, 203-11 
slope overload error, 200 
transient period, 200 
Delta PCM, 224 

Delta-sigma modulation (D-SM), 142, 202, 
212 

channel bandwidth, 213 
noise performance, 213-15 
Dibit, 257n 

Differential decoding, 161 
of codes, 107-8 
of polar waveforms, 279-80 
Differential detection 
of FSK, 313-17 
of MSK, 313-17 
of PSK, 306-13 
Differential encoding, 161 
of codes, 107-8 
of polar waveforms, 279-80 
Differential PSK, 306-13 
bit error probability, 312 
Differential pulse code modulation 
(DPCM), 224 
adaptive, 230 
channel bandwidth, 226 
noise performance, 227-29 
step sizes, 224 

Differential waveform (format), 107 
Digicom system, 1 
Digital 
message, 5 
signal, 5 
source, 5 

Digital hierarchy, 122-24 
Digital multiplexing (see Multiplexing, 
digital) 

Digital-to-analog converter, 8 
DM (see Delta modulation) 

Double sine FSK, 290 
DPCM (see Differential pulse code 
modulation) 

DPSK (see Differential PSK) 

DSFSK ( see Double sine FSK) 

D-SM (see Delta-sigma modulation) 
Duobinary 
precoding, 110 
system, 1 86—87 

waveform (format), 107-11, 119 
Duplex 
full, 268 
half, 268 
Dynamic range 
extensions, 216 
factor, 222 


Encoding 

channel, 87-102 
differential, 107-8, 161, 279-80 
source, 78-87 

Energy density spectrum, 378 


of network response, 386 
Energy signal, 376 
Ensemble, 399 
Entropy of a source, 83 
Equalizers 
adaptive, 186 
preset, 186 
transversal, 182 
Equalizing Alter 
in flat-top sampling, 26, 29, 31 
for ISI control, 182 
in sample-holds, 31, 33 
transversal, 182 

Equivalent rectangular pulse, 28 
Ergodic random process, 401 
Error correcting codes (see Codes) 
Error-free performance, 355 
Error function 
defined, 398, 422-23 
tabulated, 423 

Error probability (see particular system) 
Event, 392 
Expandor, 72 
Expectation, statistical 
correlation, 397 
of a function, 396-97 
mean, 397 
moments, 410 

of random process, 400-402, 404 
of random variables, 397 
variance, 397 
Experiment, 391 
Exponential density, 411 


Fast frequency shift keying (FFSK), 279 
FFSK (see Fast frequency shift keying) 
Filter 
ideal, 386 

matched, 143, 406-7 
First-order sample-hold, 32 
Fourier series 
complex, 382, 389 
trigonometric, 382 
Fourier transforms 
defined, 377 
of periodic signals, 383 
properties of, 377-78, 388 
table of, 424-25 
Frame, 48, 120 
synchronization, 49, 128 
Frequency multiplexing, 124 
Frequency shift keying (FSK) 
bandwidth, 265 

bit error probability, 264-65, 267, 306 
coherent, 262-70 
continuous phase, 266 
discontinuous, 262 
with independent oscillators, 262 
M-ary, 359-60 
noncoherent, 304-6 
power spectrum, 265, 268-70 
threshold, 264 
word error probability, 360 
FSK (see Frequency shift keying) 


Gamma density, 67 
Gaiussian 

density, 146, 293, 340, 397, 420-21 
distribution, 398, 420-21 
generalized distribution, 68 
message, 67 
Generator matrix, 92 
Golay code, 164 

Gram-Schmidt procedure, 333, 334-36, 
338 

Granular noise (error) 
in ADM, 221 
in DM, 200 
in DPCM, 227 
in D-SM, 213-14 
Group code, 88 
Guard time, 48 


Hamming 

codes, 90 
distance, 94, 99 
minimum distance, 139 
Hard-decision decoding, 99n 
HCDM (see Hybrid companding delta 
modulation) 

Hierarchies, 122-24 
High-density bipolar waveform, 106 
Hilbert transform 
defined, 37n 
product property, 55 
in sampling, 37-38 
Huffman code,’ 83-84 
Hybrid companding delta modulation 
(HCDM), 223 


Ideal filters, 386 
Impulse function, 140 
defined, 382, 390 
with functional argument, 389 
Impulse response, 385 
Information, defined, 82-83 
Interleaving, 120 
Interpolating function, 14, 44 
canonical, 47 

Intersymbol interference (ISI), 162, 171 
ISI (see Intersymbol interference) 


Joint probability 

defined, 392 
density function, 395 
distribution function, 394 


Laplace density, 67, 74, 411 
Leibniz’s rule, 66, 194 
Logarithmic companding, 74 


Manchester format, 106-7, 119 
MAP receiver (see Maximum a posteriori 
receiver) 


Markov models, 87 

M-ary amplitude shift keying (M-ASK), 
346-49 

signal constellation, 346 
word error probability, 348 
M- ary frequency shift keying (M-FSK), 
359-60 

symbol error probability, 360 
M-ary phase shift keying (M-PSK), 349- 52 
signal constellation, 350 
word error probability, 351 
M-ary pulse amplitude modulation (M- 
PAM) 

thresholds in, 194 
word error probability, 194-96 
M-ary waveform, 112-14 
M-ASK (see M-ary amplitude shift 
keying) 

Master clock, 120 
Matched filter, 143, 406-7 
in binary receivers, 149-50, 249, 297- 
300 

in coherent ASK, 249 
for colored noise, 406-7 
in DM, 200 
in DPSK, 308 

in M-ary receivers, 195, 345-46 
in MSK, 284 
in noncoherent ASK, 300 
in noncoherent FSK, 305 
for white noise, 407 
Maximum a posteriori (MAP) receiver, 

340 

Maximum likelihood decoding, 99 
Maximum likelihood receiver, 340 
Mean frequency, 385 
Mercer’s theorem, 373 
M-FSK (see M-ary frequency shift keying) 
Miller waveform (format), 112, 120 
Minimum shift keying (MSK), 267, 271-91 
bit error probability, 288-89 
with differential encoding, 279, 281 
other implementations, 290-91 
power spectrum, 287 
serial, 282-87, 288-90 
type I, 274-79 
type II, 281 

M-level pulse position modulation (M- 
PPM), 360-62 

symbol error probability, 361 
Modem, 121, 244 
AT&T model 103. 268 
full-duplex, 268 
half-duplex, 268 
simplex mode, 268 
Modified duobinary 
system, 192 

waveform (format), 111-12, 119 
Modulation index, 219, 291 
Modulator, 102 
Moments, 410 

M-PAM (see M-ary pulse amplitude 
modulation) 

M-PPM (see M-level pulse position 
modulation) 


T 
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M-PSK (see M- ary phase shift keying) 
MSK (see Minimum shift keying) 
Mu-law compressor, 74-75 
Multiframe, 121 

synchronization, 128 
Multiplexers, 120, 121, 122, 124 
Multiplexing 
asynchronous, 120 
bit interleaved, 120 
bit stuffing in, 124 
bursts in, 129 


digital, 120 
frame in, 48 
guard time in, 48 
hierarchies, 122-24 
master clock, 120 
multiframe in, 121 
postamble, 130 
preamble, 130 
quadrature, 259 
superframe in, 122 
synchronization, 49 
synchronous, 120 

time division, 12, 47—49, 120-28, 130 
time division multiple access (TDMA), 


128-30 

time slot in, 48 
tributaries in, 120 
word interleaved, 120 
MUX, 120 


Nonretum-to-zero (NRZ) format, 104 

Norm, 334 

NRZ- mark, 107 

NRZ-space, 107 

NTT, 121 

Nyquist rate, 12, 35 

Nyquist’s vestigial-symmetry theorem, 172 


Offset quadrature phase shift keying 
(OQPSK), 259-60 
One-out-of-two format, 106 
On-off keying (OOK), 247, 300 
OOK (see On-off keying) 

Optical telegraph links, 3 
Optimization criterion, 143, 147-48 
Optimum systems 
binary coherent, 148-55 
binary noncoherent, 291-300 
Af-ary, 338-46 

OQPSK (see Offset quadrature phase shift 
keying) 

Orthogonal signals (functions), 15, 264, 
331, 352-53 

Orthonormal basis functions, 332 
Orthonormal signals, 331 
Overload 
amplitude, 58, 203 
slope, 200, 212-13, 215-16, 225 


Packets, 130 

Parallel-to-serial converter, 120, 258 
Parity check 
bit, 89 
matrix, 90 

Parseval’s theorem, 15, 378 
Partial response, 176 
generalized, 179-81 
polynomial, 180 
Pattern noise, 261 
PCM ( see Pulse code modulation) 
Periodic signal 
bandwidth, 385 
defined, 382 
Fourier series, 382 
power, 383 
spectrum, 383 

Phase reversal keying (PRK), 252 
Phase shift keying (PSK), 5, 251 
bandwidth of, 252 
binary, 251-55 
bit error probability, 254 
M-ary, 349-52 
power spectrum in, 252 
signal constellation, 349-50 
threshold in, 252 
word error probability, 350-52 
Phase trellis, 272 
Polar format, 104-5, 113, 117 
Postamble, 130 
Power density spectrum, 381 
average, 403 
average cross, 403 
cross, 403 

of network response, 386 
of periodic signal, 383 
of random signal, 402 
Power signal, 376, 380 
Preamble, 130, 262 
Precoding 

in duobinary system, 110 
in modified duobinary system, 1 1 1 
PRK (see Phase reversal keying) 
Probability, 392 
conditional, 393 
joint, 392 

a posteriori, 147, 340 
a priori, 148 

Probability density function 
conditional, 395-96 
defined, 395 

of discrete random variable, 395 

exponential, 411 

gamma, 67 

Gaussian, 67, 68 

joint, 395 

Laplace, 67, 74, 411 
marginal, 411 
properties of, 395 
Rayleigh, 303, 312, 411 
Rice, 304, 312 
of a sum, 412 
uniform, 410 

Probability distribution function 
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conditional, 395-96 
defined, 394 

of discrete random variable, 394 
Gaussian, 398, 420-21 
joint, 394 
properties of, 394 
Pseudotemary waveform, 105 
PSK (see Phase shift keying) 

Pulse code modulation (PCM), 4, 155 
bit error probability, 160, 161 
with differential encoding, 161 
intersymbol interference in, 162 
noise performance, 165, 167 
threshold effect, 166-68 
threshold in, 169—70 
word error probability, 163 
Pulse stuffing (see Stuffing) 


QAM (see Quadrature amplitude 
modulation) 

Q function, 398 
Marcum’s, 317 

QPSK (see Quadrature phase shift keying) 
Quadrature amplitude modulation (QAM), 
259 

Quadrature component representation of 
noise, 310 

Quadrature modulation, 259 
Quadrature multiplexing, 259 
Quadrature phase shift keying (QPSK), 

255 

bit error probability, 256, 258, 364 
with one message source, 256-59 
with two message sources, 255-56 
Quantization, 4 
defined, 4 
error, 58, 60, 200 
mean-squared error in, 60 
Quantized PPM (see M level pulse 
position modulation) 

Quantizers 

amplitude overload in, 58 
companded, 71-78 
loading factor of, 67 
nonuniform, 58, 66 
optimum, 66 
quantum levels in, 58 
reference power level of, 59 
step size of, 58 
uniform, 58, 63, 67 


Raised cosine spectrum, 174 
Random process, 4 
bandpass, 405-6 
defined, 399 

ensemble average of, 400, 404 
ergodic, 401 
out of networks, 404 
nonstationary, 403 
stationarity of, 401-2 
time averages of, 401 
wide-sense stationary, 401 
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Random variables 
continuous, 394 
covariance of, 397 
defined, 394 
discrete, 394 
function of, 396, 397 
moments of, 410 
statistical averages of, 396-97 
transformation of, 410 
variance of, 397 
Rayleigh density, 303, 312, 411 
Receiving station, 6, 8 
Rectangular function (rect), 379, 424 
Redundancy, 7, 88 
Reference station, 130 
Remodulator, 261 
Retum-to-zero (RZ) format, 104 
Rice density, 304, 3)2 


Sample space, 391, 392 
Sampling, 6 
aperture in, 45 

of bandpass waveforms, 33-39 

direct, 34 

efficiency, 47 

flat-top, 24 

frequency-domain, 18 

generalized, 27 

with Hilbert transforms, 37 

higher-order, 40 

ideal, 16 

instantaneous, 12 
of lowpass waveforms, 12, 19 
natural, 22 
network view of, 15 
nonuniform, 12n 
quadrature, 36 
of random signals, 19, 38 
second-order, 40-43 
signal recovery in, 30 
in two dimensions, 43 
uniform, 12 

Sampling function, 14, 379, 424 
Schwarz's inequality, 154, 191, 407 
defined, 407n 

Serial-to-parallel converter, 256-57 
SFSK (see Sinusoidal FSK) 

Shape missmatch, 73 
Signal constellation, 255, 334 
of binary signals, 357 
of biorthogonal signals, 362 
of M-ASK, 346 
of M-PSK, 349-50 
of orthogonal signals, 353 
properties of, 342 

of vertices of hypercube signals, 366 
Signal space, 333 
dimension of, 336 
Signum function (sgn), 75, 424 
Simplex signal set, 355 
Sine integral, 53 
Sinusoidal FSK (SFSK), 290 
Slope crest factor, 204-5 
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Slope overload, 200, 212-13, 215-16, 225 
error, 200 
factor, 206 

Slotted ALOHA system, 130 
Soft-decision decoding, 99n 
Source 

average information of, 82 
encoding, 78, 83 
entropy, 83 
extension, 84 
probabilities, 148 
zero-memory, 83 
Spectral extent, 5, 384-85 
Spectral folding, 17 
Split-phase format, 106 
Spread spectrum, 57n 
State diagram, 94, 98 
States, 96 

Statistical averages (see Expectation, 
statistical) 

Statistical independence, 393, 396 
Step size 

control in ADM, 218-20 
in DM, 200 
in DPCM, 224 
in D-SM, 213 
in quantizer, 58 
Stuffing, 124 
Superframe, 121 
Survivor, 100 
Symbol interval, 329 
Synchronization 
of frame, 128 
of multiframe, 128 
of multiplexers, 49, 128 
Synchronous multiplexer, 120, 121 
Syndrome, 91 
Systematic code, 90 


Tables 

of complementary error function, 423 
of error function, 423 
of Fourier transforms, 424-25 
of Gaussian density, 421 
of Gaussian distribution, 421 
Tail, 99 

TDMA (see Time division multiple access) 

Ternary code, 7 

Threshold 

in coherent binary systems, 149, 236, 
245, 324, 327 
in coherent FSK, 264 
in DM, 242 „■ 
in D-SM, 242 
in duobinary system, 189 
in M-ary PAM, 194 
in noncoherent binary ASK, 302 
in PCM, 169-71 


Threshold effect in PCM, 166 
Time average, 380, 401 
Time division multiple access (TDMA), 
128 

Time division multiplexing (see 
Multiplexing) 

Time slot, 48 
Transfer function, 386 
Transient interval (period) 
in DM, 200 
of encoder, 96 
Transmitting station, 6 
Transversal equalizer, 182 
Tree diagram, 94 
Trellis diagram, 94 
Trial, 391 

Triangular function (tri), 29 
defined, 379, 424 
Tributaries, 120 
Twinned binary format, 106 


Uniform density, 410 

Uniform sampling theorem (see Sampling, 
uniform) 

Unipolar format, 103-4, 113, 115-17 
Unique word, 130 
Unit-step function, 394, 424 


Variance, 397 

Variance (power) mismatch, 73 
Vertices of hypercube signal set, 365-66 
Vestigial-symmetry theorem, 172 
Viterbi algorithm, 98 


White noise, 8, 337-38, 407 
Word interleaving, 120 
Word stuffing, 124 
Word (symbol) error probability 
in binary systems, 163-64 
with biorthogonal signals, 364 
in M-ary PAM, 195-96 
in M-ary systems, 342 
in Af-ASK, 348 
in M-FSK, 360 
in A/-PPM, 361 
in M-PSK, 350-52 
with orthogonal signals, 353-55 
in QPSK, 364 

with vertices of hypercube signals, 367 


Zero-order sample-hold, 30 


